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TITLE 

A METHOD FOR HIGH-DENSITY MICRO ARRAY MEDIATED 

GENE EXPRESSION PROFILING 
This application claims the benefit of U.S. Provisional Application 
5 No. 60/159,898, filed October 15, 1999. 

FIELD OF THE INVENTION 
This invention is in the field of bacterial gene expression. More 
specifically, this invention is a method for the high density, microarray-mediated 
gene expression profiling of Escherichia coli for comprehensive gene expression 
10 analysis. 

BACKGROUND OF THE INVENTION 
Escherichia coli has been exhaustively studied for over 50 years. Early 
experiments measured the molecular^fluxes from small compounds into 
macromolecular constituents. These studies were followed by others in which 

15 small molecule pools of central metabolic building blocks, nucleotides and amino 
acids were determined. The levels of several macromolecular components, 
including individual species of proteins, have been measured. Such measurements 
of the steady state provide a census of the cellular content while changes upon 
imposition of a stress catalogue the cell's fight for survival. This response to an 

20 insulting or adverse condition can take many forms from relieving end product 
inhibition to derepressing transcription. 

In E. coli, experiments to define stress-related, global regulatory responses 
have often relied upon one of two approaches. In the first, operon fusions induced 
by a particular stress are isolated. In the second, proteomic measures in which the 

25 protein fractions from stressed and un-stressed cultures are separated by a 
two-dimensional method and then compared. Each method has an inherent 
technological hurdle; for the former, the map location of responsive gene fusions 
must be known precisely, and for the latter, induced or repressed proteins excised 
from the two-dimensional gels must be identified. 

30 Another method uses a transposon-mediated mutagenesis (Spector et aL J, 

Bacteriol 170:345-351 (1988)). A reporter gene is inserted at a random location 
in the genome using a transposon. By assaying for the reporter gene before and 
after the treatment, genes affected by the treatment can be mapped and cloned by 
using the linked transposon as a marker. However, this method is limited to 

35 non-essential genes. 

Alternatively, mRNA measurements utilizing techniques (such as 
hybridization to DNA and primer extension) have allowed the monitoring of 
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individual gene's expression profiles. DeRisi et al. (Science 278:680-686 (1997)) 
reported the expression profiling of most yeast genes. The measurements were 
facilitated by high-density arrays of individual genes and specific labeling of 
cDNA copies of eukaryotic mRNA using polyA tail-specific primers. The lack of 
5 a polyA tail and the extremely short bacterial mRNA half life represent hurdles for 
the application of DNA micro-array technology to prokaryotic research. 

A comprehensive expression profiling has been performed previously with 
the yeast Saccharomyces cerevisiae. Adaptation of RNA isolation and labeling 
protocols from eukaryotes to prokaryotes is not straightforward since eukaryotic 
10 mRNA manipulations often exploit 3'-polyadenylation of this molecular species. 

Chuang et al. (J. BacteroL 175:2026-2036 (1993)) reported an expression 
profiling using large DNA fragments from an ordered X library of E. coli genomic 
fragments as a capture reagent. It allowed the comparison of the expression 
patterns from large portions of DNA fragments by comparing mRNA levels from 
15 stressed and unstressed E. coli cultures. The resolution of this method, however, 
was unsatisfactory. Expression of groups of genes, as opposed to the expression 
of each individual gene was measured. Moreover, the method used radio-labeled 
DNA as a probe with the incumbent need for safety precautions. Furthermore, the 
use of radio-labeled probe prevents the simultaneous measurement of the 
20 expression level in a test sample and a control sample. 

Richmond et al. (Nucleic Acids Research, 19:3821-3835 (1999)) has 
recently reported genome- wide expression profiling of E. coli at a single ORF 
level of resolution. Changes in RNA levels after exposure to heat shock or IPTG 
were analyzed using comprehensive Bow density blbts'of individual ORFs on a 
25 nylon matrix and comprehensive high! density arrays* of individual ORFs spotted 
on glass slides. The results of the twd methods were compared. 

The methods recited above permit monitoring 'of the effect of 
environmental changes on gene expression by comparing expression levels of a 
limited number of genes. They, however, fail to monitor the comprehensive 
30 responses of a preponderaiice of individual genes in the genome of an organism in 
reliable, useful manner. ' '"i.H? • 

The problem to be solved, therefore, is to provide a way to measure the 
comprehensive gene expression profile analysis of the organism. 

SUMMARY OF THE INVENTION 
35 The invention provides a method for identifying gene expression changes 
within a bacterial species comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
comprised in a bacterial species; 

2 
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(b) generating a first set of labeled probes from bacterial RNA, the 
RNA isolated from the bacterial species of step (a); 

(c) hybridizing the first set of labeled probes of step (b) to the 
comprehensive micro-array of step (a), wherein hybridization 

5 results in a detectable signal generated from the labeled probe; 

(d) measuring the signal generated by the hybridization of the first set 
of labeled probe to the comprehensive micro-array of step (c); 

(e) subjecting the bacterial species of step (a) to a gene expression 
altering condition whereby the gene expression profile of the 

10 bacterial species is altered to produce a modified bacterial species ; 

(f) generating a second set of labeled probes from bacterial RNA, the 
RNA isolated from the modified bacterial species of step (e); 

(g) hybridizing the second set of labeled probes of step (f) to the 
comprehensive micro-array of step (a), wherein hybridization 

15 results in a detectable signal generated from the labeled probe; 

(h) measuring the signal generated by the hybridization of the second 
set of labeled probes to the comprehensive micro-array of step (g); 
and ' ... ;\; ,' i ; ; J j, 1 • > , , : « . 

(i) comparing signal generated from the first hybridization to the 
20 signal generated from the second hybridization to identify gene 

expression changes within a bacterial species. 
Additionally the invention provides a method for identifying gene 
expression changes within a bacterial strain comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
25 comprised in a bacterial species ; 

(b) generating a first set of fluorescent cDNA from bacterial RNA, the 
RNA isolated from the bacterial species of step (a); 

(c) hybridizing the first set of fluorescent cDNA of step (b) to the 
comprehensive micro-array of step (a), wherein hybridization 

30 results in a detectable signal generated from the fluorescent cDNA; 

(d) measuring the signal generated by the hybridization of the first set 
of fluorescent cDNA to the comprehensive micro-array of step (c); 

(e) subjecting the bacterial spJecies of step (a) to a gene expression 
altering f condition whereby the gene expression profile of the 

35 bacterial species is altered to produce a modified bacterial species; 

(f) generating a second set of fluorescent cDNA from bacterial RNA, 
the RNA isolated from the modified bacterial species of step (e); 
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(g) hybridizing the second set of fluorescent cDNA of step (f) to the 
comprehensive micro-array of step (a), wherein hybridization 
results in a detectable signal generated from the fluorescent cDNA; 

(h) measuring the signal generated by the hybridization of the second 
5 set of fluorescent cDNA to the comprehensive micro-array of 

step (g); and 

(i) comparing signal generated from the first hybridization to the 
signal generated from the second hybridization to identify gene 
expression changes within a bacterial species . 

10 In an alternate embodiment the invention provides a method for 

identifying gene expression changes within a genome comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
comprised in a prokaryotic or eukaryotic speices; 

(b) generating a control set of fluorescent cDNA from total or 
15 polyadenylated RNA, the RNA isolated from the species of 

step (a), the fluorescent cDNA comprising at least one first 
fluorescent label and at least one different second fluorescent label; 

(c) mixing the control set of fluorescent cDNA labeled with the at least 
one first label with the control set of fluorescent cDNA labeled 

20 with the at least second first label to for a dual labeled control 

cDNA; 

(d) hybridizing the dual labeled control set of fluorescent cDNA of 
step (c) to the comprehensive micro-array of step (a), wherein 
hybridization results in a detectable signal generated from the 

25 fluorescent cDNA; 

(e) measuring the signal generated by the hybridization of the dual 
labeled control set of fluorescent cDNA to the comprehensive 
micro-array of step (c); 'i * 

(f) subjecting the prokaryote or eukaryote of step (a) to a gene 

30 expression altering condition whereby the gene expression profile 

of the prokaryote or eukaryote is altered to produce a modified 
prokaryote or eukaryote ; 

(g) generating an experimental set of fluorescent cDNA from total or 
polyadenylated RNA, the RNA isolated from the modified 

35 prokaryote or eukaryote of step (e), the fluorescent cDNA 

comprising the first fluorescent label and the different second 
fluorescent label to step (b); 
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(h) mixing the experimental set of fluorescent cDNA labeled with the 
at least one first label with the experimental set of fluorescent 
cDNA labeled with the at least second first label to form a dual 
labeled experimental cDNA; 
5 (i) hybridizing the experimental set of fluorescent cDNA of step (h) to 

the comprehensive micro-array of step (a), wherein hybridization 
results in a detectable signal generated from the fluorescent cDNA; 

0) measuring the signal generated by the hybridization of the second 
set of fluorescent cDNA to the comprehensive micro-array of 
10 step (g); and 

(k) comparing signal generated from the dual labeled control 

hybridization with the dual labeled experimental hybridization to 
identify gene expression changes within a prokaryotic or 
eukaryotic species. 

15 In another embodiment the invention provides a method for quantitating the 
amount of protein specifying RNA contained within a genome comprising: 
(a) providing a comprehensive micro-array comprising a multiplicity 
of genes synthesized from genomic DNA comprised in a 
prokaryotic or eukaryotic organism; 
20 (b) generating a set of fluorescent cDNA from total or poly-adenylated 

RNA isolated from the prokaryotic or eukaryotic organism of 
step (a); 

(c) generating a set of fluorescent DNA from genomic DNA isolated 
from the prokaryotic or eukaryotic organism of step (a); 
25 (d) hybridizing the fluorescent cDNA of step (b) to the comprehensive 

micro-array of step (a), wherein hybridization results in a first 
fluorescent signal generated from the fluorescent cDNA for each 
gene; 

(e) hybridizing the fluorescent DNA of step (c) to the comprehensive 
30 micro-array of step (a), wherein hybridization results in a second 

fluorescent signal generated from the fluorescent DNA for each 
gene; and 

(f) dividing, for each open reading from, the first fluorescent signal 
into the second fluorescent signal to provide a quantitated measure 

35 of the amount of protein specifying RNA for each gene. 

The methods of the present invention are applicable to genomes contained 
within a variety of organisms including bacteria, cyanobacteria, yeasts, 
filamentous fungi, plant cells and animal cells. 

5 
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The present methods of identifying gene expression changes within 
genome may be additionally coupled with the methods of quantitating the amount 
of protein specifying RNA contained within a genome as disclosed herein. 
BRIEF DESCRIPTION OF THE DRAWINGS AND 
5 SEQUENCE DESCRIPTIONS. 

Figure 1 A describes the gene expression analysis of IPTG induction in a 
single hybridization experiment using different slide sets as capture reagents for 
Cy3-labeled cDNA derived from treated and control cells and plotted in log-log 
form. 

10 Figure IB describes the gene expression analysis of IPTG induction by 

labeling the control sample with Cy5 and the induced sample with Cy3 before 
hybridizing to a single set of 3 slides. 

Figure 1C describes an average of induced RNA and control RNA with 
Cy3 from IPTG induction, generated by label swapping. 
15 Figure ID describes data replicating the results shown in Figure 1C. 

Figure IE describes an averaging of the data of Figure 1C and Figure ID. 
Figure 2 describes the distribution of gene expression levels for cells 
grown in minimal or rich medium. 

Figure 3 describes the fractional (summed open reading frame 
20 transcripts/total open reading frame transcripts) analysis of gene expression. 

The invention can be more fully understood from the following detailed 
description and the accompanying sequence descriptions which form a part of this 
application. . ' 

The following sequences comply >vith 37, CFiCl .821-1.825 . 
25 ("Requirements for Patent Applications Containing Nucleotide Sequences and/or 
Amino Acid Sequence Disclosures - the Sequence Rules") and are consistent with 
World Intellectual Property 'Organization (WIPO) Standard ST.25 (1998) and the 
sequence listing requirements of the EPO and PCT (Rules 5.2 and 49.5(a-bis), and 
Section 208 and Annex C of the Administrative Instructions). The symbols and 
30 format used for nucleotide and amino acid sequence data comply with the rules set 
forth in 37 C.F.R. §1.822. 

SEQ ID NO:l and 2 are primers used in the amplification of the sdiA gene. 

DETAILED DESCRIPTION OF THE INVENTION 
Applicants have solved the stated problem by providing a method to 
35 measure a comprehensive mRNA expression of E. coli using a high density DNA 
microarray with a near-complete collection of E. coli open reading frames (ORFs). 
The present invention advances the art by providing: 
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(i) the first instance of a comprehensive micro-array comprising greater 
than 75% of all open reading frames from a prokaryotic organism, overcoming the 
problems of high concentration of endogenous RNAase and ribosomal RNA; 

(ii) a method for quantitating the amount of each protein specifying RNA 
5 contained within a culture; and 

(iii) a method for decreasing the background noise generated within a gene 
expression profile through the combination of multiple signal generating labels. 

The present invention has utility in many different fields. Many discovery 
compounds can be screened by comparing their gene expression profile to a 
10 known compound that affects the desirable target gene products. Additionally 
gene expression profiles are good indicators of genotypic alterations among 
strains. The present invention may allow the discovery of complementary target 
inhibitors in combination drug-therapy and may be used as a modeling system to 
test perturbations in process conditions to determine the conditions for the high 
15 yield of desired production in various bio-processes and biotransformations. 

In this disclosure, a number of terms and abbreviations are used. The 
following definitions are provided. 

"Open reading frame" is abbreviated ORF. The term "ORF" is refers to a 
gene that specifies a protein. ■ , 

20 "Polymerase chain reaction" is abbreviated PCR. 

The term "micro-array" means an array of regions having a density of 
discrete regions of oligonucleotides of at least about* 100/cm 2 , and preferably at 
least about 1000/cm 2 . 

The term "comprehensive micro array" refers to high-density micro-array 
25 containing at least 75% of all open reading frames of the organism. 

The term "expression profile" refers to the expression of groups of genes. 

The term "gene expression profile" refers to the expression of an 
individual gene and of suites of individual genes. 

The "comprehensive expression profile" refers to the gene expression 
30 profile of more than 75% of all genes in the genome. 

The term "high density" as used in conjunction with micro-array means 
and array having an array density of generally greater than about 60, more 
generally greater than about ltid] most generally greater than about 600, often 
greater than about 1000, more often greater than about 5,000, most often greater 
35 than about 10,000, preferably greater than about 40,000 more preferably greater 
than about 100,000, and most preferably greater than about 400,000 different 
nucleic acids per cm. 2 
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As used herein, an "isolated nucleic acid fragment" is a polymer of RNA 
or DNA that is single- or double-stranded, optionally containing synthetic, non- 
natural or altered nucleotide bases. An isolated nucleic acid fragment in the form 
of a polymer of DNA may be comprised of one or more segments of cDNA, 
5 genomic DNA or synthetic DNA. 

The term "probe" refers to a single-stranded nucleic acid molecule that can 
base pair with a complementary single stranded target nucleic acid to form a 
double-stranded molecule. ~~ ~ ~ ~ 

The term "genotype" refers to the genetic constitution of an organism as 
10 distinguished from its physical appearance. 

The term "genomic DNA" refers to total DNA from an organism. 

The term "total RNA" refers to non-fractionated RNA from an organism. 

The term "protein specifying RNA" or "protein specifying transcript" or 
"mRNA" refers to RNA derived from ORF. 
15 The term "label", will refer to a substance which may be incorporated into 

DNA or RNA which will emit a detectable signal under various conditions. 
Typically a label will be a fluorescent moiety.. , , lt . . ( , 

A nucleic acid molecule is "hybridizable" to another ^ucleic acid 
molecule, such as a cDNA, genomic DNA, or RNA, when a single stranded form 
20 of the nucleic acid molecule can anneal to the other nucleic acid molecule under 
the appropriate conditions of temperature and solution ionic strength. 
Hybridization and washing conditions are well known and exemplified in 
Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory 
Manual, Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring 
25 Harbor (1 989), particularly Chapter 1 1 and Table 11.1 therein. The conditions of 
temperature and ionic strength determine the "stringency" of the hybridization. 
Hybridization requires that the two nucleic acids contain complementary 
sequences, although depending on the stringency of the hybridization, mismatches 
between bases are possible. The appropriate stringency for hybridizing nucleic 
30 acids depends on the length of the nucleic acids and the degree of 

complementation, variables well known in the art. The greater the degree of 
similarity or homology between two nucleotide sequences, the greater the value of 
T/w for hybrids of nucleic acids having those sequences. The relative stability 
(corresponding to higher Tm) of nucleic acid hybridizations decreases in the 
35 following order: RNA:RNA, DNA:RNA, DNA:DNA. For hybrids of greater 
than 100 nucleotides in length, equations for calculating T/w have been derived 
(see Sambrook et al., supra, 9.50-9.51). For hybridizations with shorter nucleic 
acids, i.e., oligonucleotides,, the position of mismatches becomes more important, 

= . ' 8 
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and the length of the oligonucleotide determines its specificity (see Sambrook 
et al., supra, 1 1 .7-1 1 .8). Furthermore, the skilled artisan will recognize that the 
temperature and wash solution salt concentration may be adjusted as necessary 
according to factors such as length of the probe. 
5 The term "complementary" is used to describe the relationship between 

nucleotide bases that are capable to hybridizing to one another. For example, with 
respect to DNA, adenosine is complementary to thymine and cytosine is 
complementary to guanine. 

"Gene" refers to the part of the genome specifying a macromolecular 
10 product be it RNA or a protein and include regulatory sequences preceding 
(5' non-coding sequences) and following (3' non-coding sequences) the coding 
sequence. 

A "genetic site" refers to a genomic region at which a gene product 
operates. 

15 "Coding sequence" or "open reading frame" (ORF) refers to a DNA 

sequence that codes for a specific amino acid sequence. "Suitable regulatory 
sequences" refer to nucleotide sequences located upstream (5' non-coding 
sequences), within, or downstream (3* non-coding sequences) of a coding 
sequence, and which influence the transcription, RNA processing or stability, or 

20 translation of the associated coding sequence. Regulatory sequences may include 
promoters, translation leader sequences, introns, and polyadenylation recognition 
sequences. 

"Promoter" refers to a DNA sequence capable of controlling the 
expression of a coding sequence or functional RNA. In general, a coding 

25 sequence is located 3 ! to a promoter sequence. Promoters may be derived in their 
entirety from a native gene, or be composed of different elements derived from 
different promoters found in nature, or even comprise synthetic DNA segments. It 
is understood by those skilled in the art that different promoters may direct the 
expression of a gene in different tissues or cell types, or at different stages of 

30 development, or in response to different environmental conditions. Promoters 

which cause a gene to be expressed in most cell types at most times are commonly 

' • "» • j« .! .j! i.>-i ' i-: 

referred to as 1 constitutive promoters''. It is further recognized that since in most 

cases the exact boundaries of regulatory sequences have not been completely 

defined, DNA fragments, of different 1 lengths may have identical promoter activity. 

35 "RNA transcript" refers to the product resulting from RNA polymerase- 

catalyzed transcription of a DNA sequence. When the RNA transcript is the 

polymer product of an RNA polymerase, it is referred to as the primary transcript 

or it may be a RNA sequence derived from post-transcriptional processing of the 
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primary transcript and is referred to as the mature RNA. "Messenger RNA 
(mRNA)" refers to the RNA that is without introns and that can be translated into 
protein by the cell. "cDNA" refers to a double-stranded DNA that is 
complementary to and derived from mRNA. 
5 The term "expression", as used herein, refers to the transcription and stable 

accumulation of sense (mRNA) or antisense RNA derived from genomic DNA. 
Expression may also refer to translation of mRNA into a polypeptide. 

The term "stress" or "environmental stress" refers to the condition 
produced in a cell as the result of exposure to an environmental insult. 

10 The term "insult" or "environmental insult" refers to any substance or 

environmental change that results in an alteration of normal cellular metabolism in 
a bacterial cell or population of cells. Environmental insults may include, but are 
not limited to, chemicals, environmental pollutants, heavy metals, changes in 
temperature, changes in pH, as well as agents producing oxidative damage, DNA 

15 damage, anaerobiosis, arid changes in'riitrate availability or pathogenesis. 

The term "stress response" refers to the cellular response to an 
environmental insult. ' ' ' * 

The term "stress gene" refers to any gene whose transcription is induced as 
a result of environmental stress or by the presence of an environmental insult. 

20 The term "modified bacterial species" refers to a bacterial culture that has 

been exposed to a stress or insult such that either it demonstrates a change in its 
gene expression profile. Typically the modified bacterial species is produced as 
the result of induction or challenge of the culture with a chemical or 
environmental challenge. Similarly, a "modified prokaryotic or eukaryotic 

25 species" refers to either a prokarytoic or eukaryotic organism that has been 

exposed to a stress or insult such that the gene expression profile of that organisms 
as been altered. 

The term "log phase", "log phase growth", "exponential phase" or 
"exponential phase growth" refers to cell cultures of organisms growing under 
30 conditions permitting the exponential multiplication of the cell number. 

The term "growth-altering environment" refers to energy, chemicals, or 
living things that have the capacity to either inhibit cell growth or kill cells. 
Inhibitory agents may include but are not limited to mutagens, antibiotics, UV 
light, gamma-rays, x-rays, extreme temperature, phage, macrophages, organic 
35 chemicals and inorganic chemicals. 

Standard recombinant DNA and molecular cloning techniques used here 
are well known in the art and are described by Sambrook, J., Fritsch, E. F. and 
Maniatis, T., Molecular Cloning: A Laboratory Manual , Second Edition, Cold 
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Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1989) (hereinafter 
"Maniatis"); and by Silhavy, T. J., Bennan, M. L. and Enquist, L. W., 
Experiments with Gene Fusions . Cold Spring Harbor Laboratory Cold Press 
Spring Harbor, NY (1984); and by Ausubel, F. M. et al., Current Protocols in 
5 Molecular Biology , published by Greene Publishing Assoc. and Wiley- 
Interscience(1987). 

The present invention provides a method to measure the changes in gene 
expression profiles of prokaryotic organisms. The present invention also provides 
a method to measure the levels of protein specifying RNA in prokaryotic and/or 

10 eukaryotic organisms. The present invention provides a method to compare the 
gene expression patterns of two samples differing in one variable. The variables 
may include but are not limited to genotype, media, temperature, depletion or 
addition of nutrient, addition of an inhibitor, physical assault, irradiation, heat, 
cold, elevated or lowered pressure, desiccation, low or high ionic strength, and 

15 growth phases. 

Gene expression profiles were determined under the following conditions 
to find: (a) differences in gene expression profiles caused by growth of E. coli in 
either minimal or rich medium, (b) changes in gene expression associated with the 
transition from exponential phase to stationary phase growth in minimal medium, 

20 and (c) the specificity of induction mediated by isopropylthiogalactoside (IPTG), 
the classic lac operon inducer, (d) the specificity of expression changes mediated 
by the amplification of sdiA, a positive activator of an operon that includes 
ftsQAZ, genes essential for septation, and (e) the changes in gene expression 
patterns with cells that cannot turn on^|he SOS stress response in comparison to 

25 wild type response when the cells are exposed to mitomycin C (MMC). 

* ' : ' ■ * - 4-",' i. ' ■ ; ' 

In its most basic form the present invention creates a comprehensive 
micro-array from a bacterial genome. Any bacteria is suitable for analysis by the 
method of the present invention where enteric bacteria (Escherichia, and 
Salmonella for example) as well as cyanobacteria (such as Rhodobacter and 

30 Synechocysiis and Bacillus, Acinetobacter, Streptomyces, Methylobacter, and 
Pseudomona are particularly suitable. 

One of skill in the art will appreciate that in order to measure the 
transcription level (and thereby the expression level) of a gene or genes, it is 
desirable to provide a nucleic acid sample comprising mRNA transcript(s) of the 

35 gene or genes, or nucleic acids derived from the mRNA transcript(s). As used 

herein, a nucleic acid derived from an mRNA transcript refers to a nucleic acid for 
whose synthesis the mRNA transcript or a subsequence thereof has ultimately 
served as a template. Thus, a cDNA reverse transcribed from an mRNA, an RNA 
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transcribed from that cDNA, a DNA amplified from the cDNA, an RNA 
transcribed from the amplified DNA, etc., are all derived from the mRNA 
transcript and detection of such derived products is indicative of the presence 
and/or abundance of the original transcript in a sample. Thus, suitable samples 
include, but are not limited to, mRNA transcripts of the gene or genes, cDNA 
reverse transcribed from the mRNA, cRNA transcribed from the cDNA, DNA 
amplified from the genes, RNA transcribed from amplified DNA, and the like. 

Typically the genes are amplified by methods of primer directed 
amplification such as polymerase chain reaction (PCR) (U.S . Patent 
No. 4,683,202 (1987, Mullis, et al.) and U.S. Patent No. 4,683,195 (1986, Mullis, 
et al.), ligase chain reaction ( LCR) (Tabor et al., Proc. Acad. Sci. U.S.A. , 82, 
1074-1078 (1985)) or strand displacement amplification (Walker et al., Proc. Natl 
Acad. ScL U.S.A., 89, 392, (1992) for Example. i; ^ ^ ■ 

The micro-array is comprehensive in that it incorporates' at least 75% of all 
ORFs present in the genome: 1 Amplified ORF's are theii spotted on slides 
comprised of glass or some other soliid substrate by methods well known in the art 
to form a micro-array. Methods of forming high density arrays of 
oligonucleotides, with a minimal number of synthetic steps are known (see for 
example Brown et al., U.S: Patent No. 6,1 10,426). The oligonucleotide analogue 
array can be synthesized dn ( a solid substrate by a variety of methods, including, 
but not limited to, light-directed chemical coupling, and mechanically directed 
coupling. See Pirrung et al., U.S. Pat. No. 5,143,854 (see also PCT Application 
No. WO 90/15070) and Fodor et al., PCT Publication Nos. WO 92/10092 and 
WO 93/09668 which disclose methods of forming vast arrays of peptides, 
oligonucleotides and other molecules using, for example, light-directed synthesis 
techniques. See also, Fodor et al., Science, 251, 767-77 (1991). 

Bacteria typically contain from about 2000 to about 6000 ORF's per 
genome and the present method is suitable for genomes of this size where 
genomes of about 4000 ORF's are most suitable. ] The ORF's are arrayed in high 
density on at least one glass microscope slide. This is in contrast to a low density 
array where ORF's are arrayed on a membranous material such as nitrocellulose. 
The small surface area of the high density array (often less than about 10 cm 2 , 
preferably less than about 5 cm 2 more preferably less than about 2 cm 2 , and most 
preferably less than about 1 .6 cm. 2 ) permits extremely uniform hybridization 
conditions (temperature regulation, salt content, etc.). 

Once all the genes of ORF's from the genome are amplified, isolated and 
arrayed, a set of probes, bearing a signal generating label are synthesized. Probes 
may be randomly generated or may be synthesized based on the sequence of 
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specific open reading frames. Probes of the present invention are typically single 
stranded nucleic acid sequences which are complementary to the nucleic acid 
sequences to be detected. Probes are "hybridizable" to the ORF's. The probe 
length can vary from 5 bases to tens of thousands of bases, and will depend upon 
5 the specific test to be done. Typically a probe length of about 15 bases to about 
30 bases is suitable. Only part of the probe molecule need be complementary to 
the nucleic acid sequence to be detected. In addition, the complementarity 
between the probe and the target sequence need not be perfect. Hybridization 
does occur between imperfectly complementary molecules with the result that a 

10 certain fraction of the bases in the hybridized region are not paired with the proper 
complementary base. 

Signal generating labels that may be incorporated into the probes are well 
known in the art. For example labels may include but are not limited to 
fluorescent moieties, chemiluminescent moieties, particles, enzymes, radioactive 

15 tags, or light emitting moieties or molecules, where fluorescent moieties are 
preferred. Most preferred are fluorescent dyes capable of attaching to nucleic 
acids and emitting a fluorescent signal. A variety of dyes are known in the art 
such as fluorescein, Texas red, and rhodamine. Preferred in the present invention 
are the mono reactive dyes cy3 (146368-16-3) and cy5 (146368-14-1) both 

20 available commercially (i.e.Amersham Pharmacia Biotech, Arlington Heights, 

IL). Suitable dyes are discussed in U.S. Patent No. 5,814,454 hereby incorporated 
by reference. f V < ; 

Labels may be incorporated by any of a number of means well known to 
those of skill in the art. However, in a preferred embodiment, the label is 

25 simultaneously incorporated during the amplification step in the preparation of the 
probe nucleic acids. Thus, for example, polymerase chain reaction (PCR) with 
labeled primers or labeled nucleotides will provide a labeled amplification 
product. In a preferred embodiment, reverse transcription or replication, using a 
labeled nucleotide (e.g. dye-labeled UTP and/or CTP) incorporates a label into the 

30 transcribed nucleic acids. 

Alternatively, a label may be added directly to the original nucleic acid 
sample (e.g., mRNA, polyA mRNA, cDNA, etc.) or to the amplification product 
after the synthesis is completed. Means of attaching labels to nucleic acids are 
well known to those of skill in the art and include, for example nick translation or 

35 end-labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and 
subsequent attachment (ligation) of a nucleic acid linker joining the sample 
nucleic acid to a label (elg., a fluorophore). 
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Following incorporation of the label into the probe the probes are then 
hybridized to the micro-array using standard conditions where hybridization 
results in a double stranded nucleic acid, generating a detectable signal from the 
label at the site of capture reagent attachment to the surface. Typically the probe 
5 and array must be mixed with each other under conditions which will permit 
nucleic acid hybridization. This involves contacting the probe and array in the 
presence of an inorganic or organic salt under the proper concentration and 
temperature conditions. The probe and array nucleic acids must be in contact for a 
long enough time that any possible hybridization between the probe and sample 

10 nucleic acid may occur. The concentration of probe or array in the mixture will 
determine the time necessary for hybridization to occur. The higher the probe or 
array concentration the shorter the hybridization incubation time needed. 
Optionally a chaotropic agent may be added. The chaotropic agent stabilizes 
nucleic acids by inhibiting nuclease activity. Furthermore, the chaotropic agent 

15 allows sensitive and stringent hybridization of short oligonucleotide probes at 
room temperature [Van Ness and Chen (1991) NucL Acids Res. 19:5143-5151]. 
Suitable chaotropic agents include guanidinium chloride, guanidinium 
thiocyanate, sodium thiocyanate, lithium tetrachloroacetate, sodium perchlorate, 
rubidium tetrachloroacetate, potassium iodide, and cesium trifluoroacetate, among 

20 others. Typically, the chaotropic agent will be present at a final concentration of 
about 3 M. If desired, one can add formamide to the hybridization mixture, 
typically 30-50% (v/v). 

Various hybridization solutions can be employed. Typically, these 
comprise from about 20 to 60% volume, preferably 30%, of a polar organic 

25 solvent. A common hybridization solution employs about 30-50% v/v 

formamide, about 0.15 to 1 M sodium chloride, about 0.05 to 0.1 M buffers, 
such as sodium citrate, Tris-HCl, PIPES or HEPES (pH range about 6-9), about 
0.05 to 0.2% detergent, such as sodium dodecylsulfate, or between 0.5-20 mM 
EDTA, FICOLL (Pharmacia Inc.) (about 300-500 kilodaltons), 

30 polyvinylpyrrolidone (about 250-500 kdal), and serum albumin. Also included 
in the typical hybridization solution will be unlabeled carrier nucleic acids from 
about 0.1 to 5 mg/mL, fragmented nucleic DNA, e.g., calf thymus or salmon 
sperm DNA, or yeast RNA, and optionally from about 0.5 to 2% wt./vol. 
glycine. Other additives may also be included, such as volume exclusion agents 

35 which include a variety of polar water-soluble or swellable agents, such as 
polyethylene glycol, anionic polymers such as polyacrylate or 
polymethylacrylate, and anionic saccharidic polymers, such as dextran sulfate. 
Methods of optimizing hybridization conditions are well known to those of skill 
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in the art (see, e.g., Laboratory Techniques in Biochemistry and Molecular 
Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed. 
Elsevier, N.Y., (1993)) and Maniatis, supra. 

The basis of gene expression profiling via micro-array technology relies on 
5 comparing an organism under a variety of conditions that result in alteration of the 
genes expressed. Within the context of the present invention a single population 
of cells was exposed to a variety of stresses that resulted in the alteration of gene 
expression. Alternatively, the cellular environment may be kept constant and the 
genotype may be altered. Typical stresses that result in an alteration in gene 

10 expression profile will include, but is not limited to conditions altering the growth 
of a cell or strain, exposure to mutagens , antibiotics, UV light, gamma-rays, 
x-rays, phage, macrophages, organic chemicals, inorganic chemicals, 
environmental pollutants, heavy metals, changes in temperature, changes in pH, 
conditions producing oxidative damage, DNA damage, anaerobiosis, depletion or 

15 addition of nutrients, addition of a growth inhibitor, and desiccation. Non-stressed 
cells are used for generation of "control" arrays and stressed cells are used to 
generate an "experimental", "stressed" or "induced" arrays. 

In an alternate embodiment the present invention provides a method for 
quantitating the amount of each protein specifying RNA contained within an 

20 organism. This is often necessary in gene expression profile analysis because the 
quantity of transcript produced as well' as its fold elevation is needed for 
quantitative analysis of the cell's physiological state: The method is applicable to 
both prokaryotic and eukary otic organisms including for example/cyanobacteria 
(such as Rhodobacter ancl Syneizhocystis) yeasts (such as Sacctiaromyces, 

25 ZygosaccharomyceSy KIuyveromyces, Candida^ Hansenula, Debaryomyces, 
Mucor, Pichia and Torulopsis), filameritous fungi (such as Aspergillus and 
Arthrobotrys), plant cells 'and animal xells. The method proceeds by generating a 
comprehensive micro-array described above, from either total or 
poly-adenylated RNA, depending on the whether the organism is prokaryotic or 

30 eukaryotic. Following the generation of the array, a set of labeled DNA and a set 
of labeled cDNA are synthesized having complementarity to the ORF's of the 
array. The signals generated from the independent hybridization of either the 
labeled DNA or cDNA are used to quantitate the amount of protein specifying 
RNA contained within a genome. 

35 In another embodiment the invention provides a method for gene 

expression profiling with a reduced signal to noise ratio. This is accomplished 
using a dual "label swapping" method and is again applicable to both prokaryotic 
and eukaryotic genomes. "Label swapping" refers to a system where a set of 
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probes or cDNA generated from control or experimental conditions are labeled 
with two different labels and mixed prior hybridization with the array. Two sets 
of control and experimental probes or cDNA's are generated. One of the control 
sets is labeled with a first label (i.e. cy3) and the other is labeled with a different 
5 second label (i.e. cy5). The two differently labeled sets are mixed and then 
hybridized with the array. The same process is repeated for the experimental 
conditions and the resulting control and experimental fluorescent signals are 
compared. This combination of signals provides (a) additional measure of each 
transcript level and (b) allows for the canceling of any bias associate with 
10 differential incorporation of fluorescently labeled nucleotide into cDNA or the 
hybridization of that cDNA. 

The preferred embodiments of the invention are discussed below. 

Bulk E. coli RNA was reverse transcribed to prepare hybridization probes. 
Despite the large amount of stable RNA (ribosomal and transfer RNAs) in the 
15 template, hybridization to protein-encoding genes was readily detected. 

As shown in Figure 1 with IPTGL induction, conditions have been 

■if', j- 

optimized to yield highly reliable data. In Figure 1, basal expression levels were 
plotted on the ordinate, induced levels on the abscissa. Panel A illustrates the 
results obtained when two Cy3-labeled probes were hybridized to duplicate whole 

20 genome array sets. Panel B ( represents an experiment in which the Cy5-labeled 
cDNA copy of control RNA and the Cy 3 -labeled copy of induced RNA were 
co-annealed to a single slide set. The RNAs used to generate the results in 
Panel B were each labeled with the other dye to allow a "reciprocal" 
hybridization. In Panel C, the resulting data were averaged with the data 

25 presented in Panel B to yield the scatter plot depicted in Panel C. A second 
independent set of RNA samples were isolated, their cDNAs labeled with both 
dyes and products hybridized in both possible combinations to generate the results 
depicted in Panel D. Panel E displays the averaged results of the two independent 
experiments depicted in Panels C and D. . 

30 Reciprocal Labeling . When the results of a single hybridization experiment using 
different slide sets as capture reagents for Cy3-labeled cDNA derived from treated 
and control cells were plotted in log-log form, lacZYA induction above the 
background was detected (Figure 1 A); variation of other genes was also 
significant as indicated by the width of the points falling along the diagonal of this 

35 scatter plot. Improvements were observed by labeling the control sample with 

Cy5 and the induced sample with Cy3 before hybridizing to a single set of 3 slides 
(Figure IB). However, there was a skewing of the data away from the abscissa 
and towards the ordinate (y 7 axis; Cy5-labeled probe). Averaging of these results 
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with others obtained using reciprocal copying of the same RNA samples (induced 
RNA reverse transcribed with Cy5 and control RNA with Cy3) resulted in a 
decreased variation between the treated and control samples (Figure 1C). Such 
"label swapping" lessened the skewing and decreased the scatter. The experiment, 
5 depicted in Figure 1C, was replicated; fresh cultures were induced and nucleic 
acids processed to yield the data depicted in Figure ID. The experiments shown 
in Figures 1C and ID each represent four measurements of individual transcript 
abundance; this repetition and averaging yielded the tight constellation shown in 
Figure IE which combined the data of Figure 1C and ID. Nonetheless, the scatter 

10 plot resulting from an experiment using the optimized protocol (Figure 1 E) 

illustrated that measurements of gene expression were still subject to considerable 
variation when the signal was in the lowest part of the detectable range. 

The effect of 1 mM IPTG upon expression of the arrayed genes was 
investigated. Duplicate RNA preparations of the control and IPTG treated cells 

15 were each labeled with Cy3 and Cy5by first strand cDNA synthesis. Averaging 
of measurements gave an optimal reliability of the data (Figure 1). Examination 
of the extent of hybridization to any individual gene revealed a wide dynamic 
range with more than a thousand fold variation in signal intensity between genes 
(see Figure 1 ). The expression of only 8 genes increased by a factor of more than 

20 2 after exposure to 1 mM IPTG for 1 5 min (Figure 1 E). These induced genes are 
listed in Table 1 . Two-fold or greater repression was not observed after this 
treatment. The most highly induced RNAs corresponded to the lac operon 
structural genes. Examples of the induced genes are b0956, melA, uxaA and 
bl783. ' < v I ■< • •• ..••'</ ;<•! •> ■ V : ' • 

25 Signal Quantitation . The present invention was applied to monitor the effects of 
growth stage and medium on gene expression. For these embodiments, signal 
quantitation was important. The percentage of RNA that programs protein 
synthesis has been determined under a wide variety of growth regimes (Bremer 
and Dennis, Escherichia coli and Salmonella: Cellular and Molecular Biology 

30 ASM Press: 922-937 (1996)). The fraction of those protein-specifying transcripts 
devoted to each arrayed gene was estimated. Hybridization signals arising from 
annealing of RNA-derived Cy3-labeled cDNA populations were quantitated by 
dividing by the signal generated using Cy3 fluorescent DNA arising from copying 
of sheared E. coli genomic DNA as a probe. The probe synthesized by copying 

35 genomic DNA was used to approximate equimolar transcription of the entire 
genome. This quantitation allowed calculation of mRNA inventories. Three 
RNA samples were measured. The samples were isolated from cells growing 
exponentially in rich medium, from cells growing exponentially in minimal 
■}''' ' • 17 
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medium, and from cells in minimal medium transitioning from exponential to 
stationary phase. RNAs from certain central metabolic (gapA, ptsH), defense 
(ahpC t cspQ, DNA metabolic (hns), surface structure (acpP t ompACFT, lpp\ 
translation (rplBCKLMPWX, rpmBCl rpsACDHJNS, trmD,fusA t infC, tufAB\ 
5 transcription (rpoAB), and unassigned (b4243) genes (Riley and Labedan, 

Escherichia coli and Salmonella: Cellular and Molecular Biology, ASM press: 
21 18-2202 (1996)) were abundant (>0.1%, among the top 100 transcripts) in all 
three samples. 

The most highly transcribed genes in actively growing broth-cultured cells 

10 often encoded proteins involved in translation. In contrast, cultures at a similar 
growth stage in glucose minimal medium, expressed to a very high level several 
small molecule biosynthetic genes and the means to utilize glucose. Thus, an 
agreement between these molecular analyses and the accumulated understanding 
of E. coli physiology was observed (Escherichia coli and Salmonella: Cellular 

15 and Molecular Biology, ASM press). This agreement was underscored in the 
analysis of cells transitioning from the exponential growth phase; the elevated 
expression of several rpaS-controlied genes corresponded to expectations 
(Escherichia coli and Salmonella: Cellular and Molecular Biology, ASM press). 
The genes, each representing between 0.0007% and 1% of the hybridizing 

20 signal, were expressed in LB grown cells. The distribution of genes as a function 
of expression level is plotted in Figure 2. Figure 3 depicts fractional expression as 
a function of summed genes with genes ranked by expression level. In Figure 2, 
the histogram plots the number of genes as a function of expression range. 
Diagonally striped, solid, and horizontally striped bars reflect distributions 

25 observed in RNAs derived from cells growing exponentially in minimal medium, 
cells transitioning to stationary phase in minimal medium, and cells growing 
exponentially in rich medium, respectively. In Figure 3, the fraction (summed 
open reading frame transcripts/total open reading frame transcripts) was plotted as 
a function of genes summed. The order in which genes were summed was based 

30 upon expression level with the most highly expresseid gene summed first. 

Fewer genes were expressed in LB than in minimal medium (Figure 2); the 
fraction of rare transcripts 'appeared iifi medium (Figure 3). 

The fifty most highly expressed genes in broth-grown' cells are listed in left-most 
columns of Table 2; twenty-six' of ttieseWenVely transcribed genes encode 

35 proteins involved in translation while' three encocie chaperones.' ' 

The broad distribution analyses (Figures 2 and 3) readily revealed the 
significant differences observed in expression of E. coli when grown in defined 
and rich media. In minimal media many more genes were transcribed over a 
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somewhat broader range. The 50 genes most highly expressed in exponentially 
growing cells cultured in minimal medium with glucose as a carbon/energy source 
are listed in the middle columns of Table 2. Eight biosynthetic genes were highly 
expressed (Table 2). Notable among them were metE, encoding the aerobic 
5 methionine synthase, and ilvC, an isoleucine-valine biosynthetic gene subject to 
feed-forward transcriptional activation (Umbarger, H.E, Escherichia coli and 
Salmonella: Cellular and Molecular Biology, ASM Press (1996) ) by its 
substrates. Both the //vC-encoded enzyme (Petersen et al„ Nucleic Acids Res, 
14:9631-9651 (1986)) and mef£-encoded enzyme (Green, R. C., Escherichia coli 

10 and Salmonella: Cellular and Molecular Biology, ASM Press (1 996)) are sluggish 
catalysts. The metE product accounts for about 5% of E. coli protein when cells 
are cultured in minimal medium with glucose as a carbon/energy source 
(VanBogelen et al., Escherichia coli and Salmonella: Cellular and Molecular 
Biology, ASM Press (1996)). Other highly expressed biosynthetic genes included 

15 folE and cysK; the folE product, GTP cyclohydrolase I catalyzes both cleavage of 
the 5-membered ring of guanine and the rearrangement of the ribose moiety of the 
substrate, GTP (Green et aL, Escherichia coli and Salmonella: Cellular and 
Molecular Biology, ASM Press (1996)). cysK, encoding o-acetylserine(thiol)- 
lyase isozyme A, is responsible for more than 90% of sulfur fixation under aerobic 

20 conditions (Kredich, N. M., Molecular Biology, ASM press (1 996)). Transcripts 
of the pyrBI operon encoding aspartate transcarbamylase also were highly 
expressed during exponential growth in minimal medium relative to a broth- 
grown culture. This expression level is a characteristic signature of strain 
MG1655 whose aspartate transcarbamylase content is elevated more than 100 fold 

25 when grown in the absence of uracil due to an rph mutation that is polar on pyrE 
(Jensen, K. F„ J. BacterioL 181:3525-3535 (1993)). The other highly expressed 
transcripts, thrL and pheF t encoded, respectively, the threonine leader polypeptide 
(Landick et aL, Escherichia coli and Salmonella: Cellular and Molecular Biology, 
ASM Press (1996)) and the phenylalanine-inhibited first enzyme of the common 

30 aromatic pathway. The pheF product, one of three isozymes, is estimated to 
account for more than 80% of , the activity catalyzing the first common step of 
aromatic amino acid synthesis (Pittard, A. J., Escherichia coli and Salmonella: 
Cellular and Molecular .Biology, ASM Press (1996)). 

In this embodiment, expression of several genes catalyzing fueling 

35 reactions was also elevated. Unexpectedly, aceAB, encoding the glyoxylate shunt 
enzymes malate synthase and isocitrate lyase (Cronan and Laporte, Escherichia 
coli and Salmonella: Cellular and Molecular Biology, ASM Press (1996)), was 
highly expressed. Perhaps the TCA cycle functions in its branched state during 
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this phase of growth requiring the glyoxylate shunt for anapleurotic replenishment 
(Neidhardt et al, Physiology of the Bacterial Cell: A Molecular Approach, 
Sinauer Associates, Inc. (1990)). As expected, transcripts encoding 
phosphotransferase sugar transport common components (Postma et al. , 
5 Escherichia coli and Salmonella: Cellular and Molecular Biology, ASM Press 
(1996)) also accumulated to a very high titer in glucose-minimal medium. 

The present invention was applied to monitor the transcripts of cells 
transitioning from exponential to stationary phase in defined, minimal medium. 
During this transition, significant changes in gene expression were expected and 

10 observed. Expressed gene levels were from 0.0023 to-h6%." A total of 

1030 genes, of which 1 10 have a defined role, did not appear to be expressed. In 
this embodiment, the 50 most highly expressed genes during this transition are 
listed in the rightmost columns of Table 2. Significantly, several rpoS-regulated 
genes (Hengge-Aronis, Escherichia coli and Salmonella: Cellular and Molecular 

15 Biology, ASM press, 1497-1512) including hdeA (11 fold), hdeB (8.9 fold), dps 
(4.4 fold), gadA (8.2 fold) and gadB (1 2 fold) (Castanie-Cornet et al., J. Bacteriol 
181:3525-3535 (1999)) as well as rpoS (2.6 fold) itself became quite highly 
expressed. Despite this remodeling of transcription, the overall patterns of gene 
number as a function of expression level figure 2) and fracticmal expression as a 

20 function of ranked gene (Figure 3) were hot as distinct as might have been 
expected in comparison to the patterns observed for RNA extracted from 
exponentially growing cells. 

The observed expression patterns are summarized in Table 3 where gene 
products were grouped by metabolic function using an established classification 

25 scheme (Riley and Labedan, Escherichia coli and Salmonella: Cellular and 

Molecular Biology, ASM Press (1996)). Exponential growth in minimal medium 

elevated the amount of pyrimidineand amino acid biosynthetic transcripts. In 

*** ^ - * i.' 

contrast cofactor and purine transcripts did not appear ( to accumulate relative to 

growth in broth. Expression of glyoxylate shunt and miscellaneous glucose 
30 transcripts was also elevated in minimal medium; the seven-fold elevation of 

glyoxylate shunt transcripts exceeded the average of that observed for amino acid 

biosynthetic mKNAs. Expression of genes involved in sulfur fixation was also 

elevated during growth in minimal medium. 

The rapid growth observed in LB was reflected in the gene expression 
35 profile, as was the difference in carbon energy/source between glucose and amino 

acids. LB-grown cultures displayed jelevated expression of genes specifying 

glucogenic enzymes and of genes whose products degrade small molecules. 

Expression of the ATP and proton motive force generating machinery, elevated by 
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a factor of about 2, paralleled increased ribosomal protein, aminoacayl-tRNA 
synthetase and foldase/usher expression. 

Changes observed upon entering the transitional period between 
exponential and stationary phase growth were less dramatic. Nonetheless, 
5 elevation of mRNAs specifying gluconogenic, glycolytic, and TCA cycle 
enzymes was observed as was an increase in transcripts-encoding enzymes 
responsible for metabolic pool interconversions and for the non-oxidative branch 
of the hexose monophosphate shunt. The cells also displayed an increased titer of 
foldase/usher-specifying and global regulatory function transcripts while 
10 transitioning between growth phases. ' 

The present invention was used to monitor the change in gene expression 
when cells overexpressd sdiA gene. The sdiA is a positive activator of an operon 
that includes ftsQAZ, genes essential for septation. f < > 

RNA isolated from broth grown, exponential phase cultures harboring 
15 either a single copy (pUC19/RFM443) or many copies (pDEW140/RFM443) of 
sdiA were compared after conversion into fluorescently labeled cDNA by 
hybridization to individual genes arrayed on glass slides. 

Expression of about 9% of the E. coli genes was elevated in the strain 
containing the multicopy sdiA plasmid (Table 4). Transcripts of seven genes 
20 involved in cell division were raised 2.1 to 1 1 fold by amplification of sdiA as 

were a large number (about 20) of genes involved in DNA replication, repair, and 
degradation. Transcript levels of eight genes whose products alter the 
susceptibility of E. coli to drugs were more highly expressed in the strain 
containing the gene amplification. This genetic configuration also resulted in 
25 elevated expression of several lipopolysaccharide biosynthetic genes (rfa) as well 
as open reading frames encoding membrane structural elements. 

Expression of several genes of unknown function was also elevated in 
response to the presence of multiple copies of sdiA. The genes whose transcripts 
were highly (>6 fold) elevated in response to the multicopy sdiA plasmid 
30 included: b0135 (6.4 fold, annotated as putative fimbrial-lilce protein gene), 
b0225 (6.4 fold, a 1 gene apparently co-transcribed s ydt\idinJ since between them 
there is only a 3 base pair intergenicregion), b0157 f (ll fold; encoding a putative 
malate dehydrogenase), b0530 (also'khown as sftnA md predicted to specify a 
fimbrial like protein was elevated 6.5 fold), b0712 (encoding a putative 
35 carboxylase had a 6.4 fold increaW'iii'tfanscript content) and bl438 (1 1 fold 
elevation in expression). ^MiUtiiwi' J< ! 

Around 3% of the E- coli genes*were repressed in a strain harboring the 
sdiA plasmid relative to. trie* control strain 'containing the vector (Table 5). The 
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genes involved in chemotaxis, mobility, and flagella biosynthesis were repressed 
dramatically. Genes for transport of certain carbohydrate substrates and cations 
(Fe** and K + ), degradation of corresponding carbon compounds, as well as 
acetate fermentation were repressed. The presence of pDEW140, a pUC 19 
5 derivative harboring sdiA, resulted in a 30-fold elevation in detectable sdiA 

transcript. Expression of sdiA was very low (0.0015 %, the 4212th most abundant 
transcript) in LB grown E. coli MG1655. The increased expression in the plasmid 
containing strain raised the transcript rank to about 300. 

Genes ddl, ftsQ, ftsA, ftsZ and IpxC are organized in the order mentioned 

10 above in the complexyfaZ containing operon, and the above genes are transcribed 
in the same direction starting with ddl Since the -encoded positive activator 
drives transcription of a mRNA including ddl, ftsQ, ftsA, ftsZ, and IpxC, increased 
quantities of RNA hybridizing to these genes were expected. Amplification of 
sdiA due to its presence on a multicopy plasmid elevated expression of ddl, ftsQ, 

15 ftsA, ftsZ and IpxC 4.6, 8.8, 10, 1 1 and 3.5 fold, respectively, relative to the strain 
that harbored pUC 19 (Table .4). ( l iti \ , t, 

Tn the immediate, down stream of sdiA, there aieyecF, followed by uvrY 
and uvrC gene, respectively, uvr Y and uvrC genes are transribed in the same 
direction as sdiA and the yecF is transcribed in the opposite direction. 

20 Unexpectedly, amplification of sdiA elevated expression of two genes downstream 
of sdiA was observed. uvrY expression was elevated 12 fold while uvrC 
transcription was increased by a factor of 9 (Table 4). These two genes were 
transcribed in the same direction as sdiA. The expression of yecF decreased only 
slightly. 

25 Amplification of sdiA caused the expression of 101 genes to fall by a 

factor of 2 or more. Among them, 44 r were involved in motility and chemotaxis. 
Thirty four genes were down regulated more than five-fold by sdiA amplification. 
Of these, thirty were involved in chemotaxis or motility (cheW; 
MB,QD,E,F,GMJJXLMMfliA^^^ tarandtsr). The 

30 master regulator genes flhC and D controlling flagella operon expression were 
lowered by only 30-38 %. 

The swarming of strains having single or multiple copies of sdiA was 
examined by spotting four single colony isolates of each strain on semi-solid 
medium. Since almost all the genes involved in flagella biosynthesis, chemotaxis 

35 and motility were dramatically repressed in the sdiA overexpression strain, loss of 
mobility of the sdiA overexpression strain was predicted. Experiments were 
carried out to compare the mobility of the two strains. After 8 hr. at 37°C, the 
strain containing pUC 19 had swarmed (diameter =32+ 2.5 mm) while that 
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containing pDEW140 (sdiA+) had not (diameter =3.2+0.4 mm). After 23 h the 
pUC19 containing strain had filled the petri plate while the strain carrying the 
sdiA amplification had significantly swarmed covering about one half of each 
plate. This partial phenotype could be explained by either (a) plasmid loss 
5 allowing swarming of a revertant (sdiA+ haploid) population as ampicillin was 
exhausted from the medium or (b) sdiA amplification only partially compromising 
motility. To distinguish between these possibilities, the site of inoculation and the 
edge of the swarm after 23 hr were streaked for single colonies to an ampicillin 
containing LB agar plate. Massive sdiA+ plasmid loss from cells at the edge of 

10 the swarm was not observed suggesting that the motility defective phenotype was 
not an absolute one. H 

If the role of sdiA is to stimulate gene expression required for septation, 
sdiA might coordinate expression of the yfrZ-containing operon with action at the 
origin of replication, oriC. The two genes immediately flanking oriC are mioC 

15 and gidA, mioC is followed by asnC and asnA, and gidA is followed by gidB, atpl 
and atpB. All of the genes except asnA are transcribed in the same direction. 
gidA and mioC were over-transcribed relative to the vector-containing control 
strain. mioC transcript content was elevated 7 fold while those of the gidA and 
gidB genes were elevated 4 and 2 fold, respectively. This effect was most 

20 localized; adjoining genes were not over-expressed. ] ' ■') 

Having found enhanced action around oriC, it was reasonable to examine 
the transcript content of genes surrounding the termini of replication when sdiA 
was amplified. There are multiple termini in £ colDThe region surrounding terB 
spans minutes 35.3-37.3 (Berlyn et al., Escherichia coli and Salmonella: Cellular 

25 and Molecular Biology ASM Press: 922-937 (i 996)) sdiA amplification-elevated 
expression of 12 of the 88 genes in this region more than 3 fold. Transcripts from 
another 26 genes in 'the region were elevated t>y a factor of 1.5 to 3. Unlike the 
action observed around the terminus, the stimulation seen in the vicinity of terB 
was diffuse. Interestingly, tau, encoding the terminus-utilizing factor, was not 

30 over-expressed. Transcription of gusR, located at 36.5 minutes, was elevated 
8 fold by sdiA amplification (Table 4). 

acr genes specify sensitivity to acriflavines, molecules that intercalate into 
double stranded DNA containing monotonic runs of base pairs. Most acr mutants 
display a defect in acridine efflux; moreover they are often pleiotropic being 

35 hypersensitive to a wide variety of chemicals. Thus hyper-expression of these 
genes in a strain harboring an sdiA-bzarmg multicopy plasmid could lead to 
mitomycin C expulsion and the observed resistance to this DNA damaging agent. 
This expectation of acr hyper-expression was confirmed. Evidence for elevated 
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expression of each acr operon was found as indicated by the fold expression 
reported in Table 4. 

Elevated transcription of the gal operon genes at minute 17 was observed 
in the strain bearing the sdiA amplification. These genes, moderately expressed 
5 when strain MG1 655 was grown in LB medium (ranks: galE 84 1 , galT 1512, 
galK 599; Wei and LaRossa, unpublished), were elevated 3.8, 4.9 and 4.1 fold, 
respectively. Nearby, at minute 16 is the ybglJKL-nei region, ybg genes are 
organized as ybg F,ybgJ, ybgK and ybgL, in that order followed .by mi gene. 
These genes, transcribed in the same orientation, could constitute an operon since 

10 the open reading frames are densely packed, at times overlapping. sdiA 

amplification elevated expression of these genes 5.2, 4.7, 6.4, 3.8 and 8.6 fold, 
respectively, mi encodes an endonuclease responsible for the excision of 
oxidized pyrimidines in the double helix. 

Two linked genes at minute 44, bl956 and bl957 were elevated 6.6 and 

15 14 fold by sdiA amplification. Similarly, expression of b201 7 and b201 6, two 
genes at minute 45 divergently transcribed from and adjacent to the his operon, 
was elevated 3.8 and 3.5 fold, respectively by the presence of the irfL4-containing 
multicopy plasmidJ ' ' ' ^M*' 

Mitomycin C (MMt) is a D^AjdamVgm coli, strain,. MG 165 5, 

20 was exposed to MMC, and gene expressions were comparecl in cells that were 
harvested altl 5 and 40 min post exposure. ' In' the cells that were harvested at 
1 5 min, very little SOS response was detected. At the 40 min, expression 
of40 genes was elevated greater than 2 fold relative to the control strain. Among 
the 40, 13 stress response genes were induced (Table 6) more than 2 fold. The 

25 SOS genes that were induced by a 40 min exposure to MMC were recN, dinl, 
sulA, lexA, recA, uvrA, dinD, priQ umuC, mioC, uvrB, ruvA, andxseA. 

The SOS responsive genes are /ex^ -dependent. In order to determine the 
gene expression patterns in the presence and the absence of the SOS response, 
DM800 and DM803 were exposed to MMC for 40 min and the gene expression 

30 profiles were compared. DM800 and DM803 harbor lexA* and lexA ind alleles, 
respectively. As expected,. when exposed to MMC for 40 min, SOS responsive 
genes were induced greater than 2 fold in DM800 strain. SOS responsive genes, 
including lexA, were not induced in the DM803 strain (Tables 7 and 8). Many 
genes that were not induced by MMC in DM800 were induced by the DNA 

35 damaging agent in DM803. For examples, the expression of the following genes 
were induced greater than 2 fold in DM803 but not in DM800 (Tables 7 and 8): 

among the induced genes are those involved with cell division (i.e., dicB, dicC, 

• « ■• ■ . J t • •• ". v . \ ■ ' 1 .jr. ih ' ' 
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and sdiA); chemotaxis and mobility (i.e., cheWand motA)\ and the transport of 
small molecules (i.e., cycA t fadL, chaC, codB and btuC). 

The present invention is not limited to only highly expressed genes for 
several reasons. First, reproducible expression measurements were obtained over 
5 a wide dynamic range (Figure IE). Second, the data of Figure 3 and Table 1 

illustrate that the lac operon expression, although low before IPTG induction, was 
detected suggesting that most transcripts can be readily measured with the 
described techniques. Analyses of well-characterized "promoter-down" mutants 
or spiking experiments may be useful in defining the lower limits of expression 
10 that can be observed. 

EXAMPLES 

The present invention is further defined in the following Examples. It 
should be understood that these Examples, while indicating preferred 
embodiments of the invention, are given by way of illustration only. From the 

15 above discussion and these Examples, one skilled in the art can ascertain the 
essential characteristics of this invention, and without departing from the spirit 
and scope thereof, can make various changes and modifications of the invention to 
adapt it to various usages and conditions. 
GENERAL METHODS 

20 Standard recombinant DNA and molecular cloning techniques used in the 

Examples are well known in the art and are described by Sambrook, J., Fritsch, 

E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press: Cold Spring Harbor, (1989) (Maniatis) and by T. J. 
Silhavy, M. L. Bennan< and L. ^V. Enquist, Experiments with Gene Fusions, Cold 

25 Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1984) and by Ausubel, 

F. M. et al., Current Protocols in Molecular Biology, pub. by Greene Publishing 
Assoc. and Wiley-Interscience (1987). 

The meaning of abbreviations is as follows: "hr" means hour(s), "min" 
means minute(s), "sec" means second(s), "d" means day(s), "mL" means 

30 milliliter(s), "pL" means microliters), "nL" means nanoliter(s), "ng" means 

microgram(s), "ng" means nanogram(s), "mM" means millimole(s), "jaM" means 

micrpmole(s). , . : , i ; 

Media and Culture Conditions: ? » ! 1 H . 1.,/ 

Materials and methods suitable for the maintenance and growth of 

35 bacterial cultures were found-in Experiments in Molecular Genetics (Jeffrey H. 
Miller), Cold spring Harbor Laboratory Press (1972), Manual of Methods for 
General Bacteriology (Phillip Gerhardt, R.G.E. Murray, Ralph N. Costilow, 
Eugene W. Nester, Willis A. Wood, Noel R. Krieg and G. Briggs Phillips, eds), 
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pp. 210-213, American Society for Microbiology, Washington, DC. or Thomas D. 

Brock in Biotechnology: A Textbook of Industrial Microbiology . Second Edition 

(1989) Sinauer Associates, Inc., Sunderland MA. All reagents and materials used 

for the growth and maintenance of bacterial cells were obtained from Aldrich 
5 Chemicals (Milwaukee, WI), DIFCO Laboraoties (Detroit, MI), Gibco/BRL 

(Gaithersburg, MD), or Sigma Chemical Company (St. Louis, MO) unless 

otherwise specified. 

LB medium contains following per liter of medium: Bacto-tryptone (10 g), 

Bacto-yeast extract (5 g), and NaCl (1 0 g). 
10 Minimal M9 medium contains following per liter of medium: Na 2 HP04 

(6 g), KH 2 P0 4 (3 g), NaCl (0.5 g), and NH 4 C1 (1 g). 

Above media were autoclaved for sterilization then 10 mL of 0.01 M 

CaCl 2 and 1 mL of MgS0 4 . 7H 2 0 plus carbon source and other nutrient were 

added as mentioned in the examples. All additions were pre-sterilized before they 
15 were added to the media. 

Molecular Biology Techniques : 

Restriction enzyme digestions, ligations, transformations, and methods for 

agarose gel electrophoresis were performed as described in Sambrook, J., et al., 

Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor 
20 Laboratory Press (1 989). Polymerase Chain Reactions (PCR) techniques were 

found in White, B., PCR Protocols: Current Methods and Applications . Volume 

15(1993) Humana Press Inc. 

EXAMPLE 1 

Example 1 demonstrates genomic DNA amplification and the preparation 

25 of the high density DNA array. 

Amplification of 4290 E, coli genes Specific primer pairs (available from Sigma 
Genosys Biotechnolgies, The Woodlands, TX) for each protein-specifying gene of 
E. coli were used in two consecutive PCR amplification reactions. Genomic DNA 
(30 ng) was used as the template in the first round of PCR amplification, and 

30 500-fold diluted PCR products served as templates for PCR re-amplification. 
Duplicate 50 |iL scale reactions were performed. The PCR reactions were 
catalyzed with ExTaq™ polymerase (Panvera, Madison, WI) with the four 
dNTPs (Pharmacia), present at 0.25 mM and the primers at 0.5 ^M. Twenty- 
five cycles of denaturation at 95°C for 30 sec, annealing at 64°C for 30 sec and 

35 polymerization at 72°C for 2 min were conducted. A 2 (iL aliquot of each PCR 
product was sized by electrophoresis through agarose gels. More than 95% of the 
second round PCR products displayed' visible bands of the correct size. Second 
round PCR reactions devoid of templates and primers were saved to serve as 
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negative controls for hybridization capture reagents. One third of each second 
round PCR reaction was purified using 96-well PCR purification kits (Qiagen, 
Valencia, CA). The eluted DNAs were dried using a vacuum centrifuge. 
Arraying amplified genes . Twenty microliters of 6M Na 2 SCN or 50% DMSO 
5 was added to each dried DNA sample (> 0.1 ng/nL). A generation II DNA spotter 
(Molecular Dynamics, Sunnyvale, CA) was used to array the samples onto coated 
glass slides (Amersham Pharmacia Biotech, Arlington Heights, IL). Aliquots of 
approximately 1 nL from 1536 resuspended PCR products were arrayed in 
duplicate on each.slide; a setof three slides supported all amplified E. coli genes. 
10 To serve as controls, 76 specific E. coli PCR products, 8 amplified genes of 
Klebsiella pnuemoniae and 12 plant cDNA clones were also spotted onto each 
slide. Spotted glass slides, after baking at 80°C for 2 hr. s were stored under 
vacuum in a desiccator at room temperature. 

EXAMPLE 2 

15 Example 2 demonstrates gene expression analysis. E. coli mRNA was 

isolated, fluorescent labeled cDNA was prepared using mRNA as a template, and 
the labeled cDNA was hybridized to the high density DNA anay. The amount of 
DNA hybridized to DNA array was quantitated and analyzed. 
Microbiological Methods . 

20 E. coli MG1655 was cultured with aeration in either the minimal medium, 

M9 (Miller, J. H., Experiments in Molecular Genetics, Cold Spring Harbor 
(1972)), supplemented with 0.4% glucose or in the rich medium, LB (Miller, J. H., 
Experiments in Molecular Genetics, Coid'Spring Harbor (1972)), at 37°C. The 
overnight culture was diluted 250 t fold into fresh medium and aerated by shaking 

25 at 37°C. Samples of the minimal, medium culture were harvested at A^-QAQ 
(exponential phase) and 1.6 (transition to stationary phase) prior to RNA isolation. 
An IPTG induction (Miller, J. H., Experiments in Molecular Genetics, Cold 
Spring Harbor (1972)) was performed to examine the specificity with which it 
effects gene expression. A culture grown overnight in LB at 37°C was diluted 

30 250 fold into fresh LB and aerated at 37°C. When the culture achieved an 

appropriate density (A 6 qo=0.40), it was split. To one portion was added IPTG to a 
final concentration of 1 mM; the untreated sample served as a control. Incubation 
of both samples was continued with aeration at 37°C for another 15 min 
( A 600 = 0.45 for both cultures) before RNA isolation was initiated. 

35 RNA Isolation . An equivalent volume of shaved ice was added to 50 mL samples 
which were pelleted immediately in a refrigerated centrifuge by spinning at 
10,4 1 0 x g for 2 min. Each resultant pellet was resuspended in a mixture 
containing 100 |iL of Tris HC1 (10 mM, pH 8.0) and 350 \xL of P-mercaptoethanol 
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supplemented RLT buffer [Qiagen RNeasy Mini Kit, Valencia, CA]. The cell 
suspension was added to a chilled 2 mL screwed-capped microfiige tube 
containing 100 \xL of 0.1 mm zirconia/silica beads (Biospec Product Inc., 
Bartlesville, OK). The cells were broken by agitation at room temperature for 
5 25 sec with a Mini-Beadbeater (TM) (Biospec Products Inc., Bartlesville, OK). 
Debris was pelleted by centrifiigation for 3 min at 16,000 x g and 4°C; the 
resultant supernatant was mixed with 250 jiL of ethanol. This mixture was loaded 
onto a column from the Qiagen RNeasy Mini Kit. RNA isolation was completed 
using the protocol supplied with this kit. Incubation for 1 hr. at 37°C in 40 mM 

10 Tris pH 8.0, 10 mM NaCl, 6 mM MgCl 2 with RNase free RQ1 DNase (1 unit/jiL, 
Promega, Madison, WI) digested any genomic DNA contaminating the RNA 
preparation. The digestion products were purified by a second passage through 
the RNeasy protocol (Qiagen, Valencia, CA). The product was eluted from the 
column in 50 |iL RNAse-free water prior to determining sample concentration by 

15 an A 2 6o reading. RNA preparations were stored frozen at either -20 or -80°C. 

Synthesis of fluorescent cDNA from total RNA . Six microgram of RNA template 
and 12 [ig of random hexamer primers (Operon Technologies, Inc., Alameda, CA) 
were diluted with double distilled (dd^ water to a volume of 22 \xL, Annealing 
was accomplished by incubation at/70°C for 10 min followed by 10 min at room 

20 temperature. In order were added: 8 of 5,x Superscript II reaction buffer (Life 
Technologies, Inc., Gaithersberg, MD), 4 jjL of 0.1M DTT, 2 ^iL of the dNTP mix 
(2 mM dATP, 2 mM dGTP, 2 mM TTP, 1 mM dCTP), 2 of 0.5 mM Cy3- or 
Cy5-dCTP (Amersham Pharmacia Biotech, Arlington Heights, IL), and 2 (iL of 
Superscript II reverse transcriptase (200 units/mL, Life Technologies Inc., 

25 Gaithersberg, MD). DNA synthesis proceeded at 42°C for 2.5 hr. before the 

reaction was terminated by heating at 94°C for 5 min. Alkaline hydrolysis of the 
RNA templates was achieved by adding 2 jxL of 5M NaOH followed by 
incubation at 37°C for 10 min. Hydrolysis was terminated by the sequential 
addition of 3 uL of 5M HC1 and 5 \iL of 1M Tris-HCl, pH 6.8. The labeled 

30 cDNA was purified with a PCR purification kit (Qiagen, Valencia, CA), dried in a 
speed vacuum and stored at -20°C. Labeling efficiency was monitored using 
either A 5503 for Cy3 incorporation, or A 650 , for Cy5 labeling, to A 2 6o/atios. 
Fluorescent labeling of genomic DNA. Genomic DNA, isolated from strain 
MG1655 (Bachmann, B., Escherichia coli and Samonella: Cellular and 

35 Molecular Biology, ASM Press ( 1 996)) by standard procedures (Van Dyk and 

Rosson, Methods in Molecular Biology: Bioluminescence Methods and Protocols, 
Humana Press Inc. (1998)), was nebulized to approximately 2 kb pair fragments. 
Three microgram of this DNA was mixed with 6 \ig of random hexamers primers 
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(Operon Technologies, Inc., Alameda, CA) in 33 pL of dd water. DNA was 
denatured by heating at 94°C prior to annealing on ice for 10 min. Fluorescent 
copying of the genomic DNA was accomplished using the Klenow fragment of 
DNA polymerase I (5 pg/ pL, Promega, Madison, WI). To the DNA mixture was 
5 added 6 pL of 1 Ox Klenow buffer (supplied with the enzyme), 3 pL of the dNTP 
mix described above, 12 pL dd H 2 0, 3 pL of 0.5 mM Cy3-dCTP (Amersham 
Pharmacia Biotech, Arlington Heights, IL), and 3 pL of the Klenow fragment of 
DNA polymerase I. After a static, 2.5 h incubation at room temperature, the 
labeled DNA probe was purified using a PCR purification kit (Qiagen, Valencia, 

10 CA) before drying in a speed vacuum. 

Hybridization and washing . Spotted slides were placed in isopropanol for 10 min, 
boiled in dd H2O for 5 min and dried by passage of ultra-clean N 2 gas prior to pre- 
hybridization. The prehybridization solution (PHS) was 3.5xSSC (BRL, Life 
Technologies Inc., Gaithersberg, MD), 0.2% SDS (BRL, Life Technologies Inc., 

15 Gaithersberg, MD), 1% bovine serum albumin (BSA,Fraction V, Sigma, St. 

Louis, MO). The hybridization solution (HS) contained 4 pL of dd water, 7.5 pL 
of 20xSSC, 2.5 pL of 1% SDS (BRL, Life Technologies Inc., Gaithersberg, MD), 
1 pL of 10 mg/ml Salmon sperm DNA (Sigma, St. Louis, MO) and 15 pL of 
formamide (Sigma, St. Louis, MO). The slides were incubated at 60°C for 20 min 

20 in PHS. The slides were next rinsed 5 times in dd water at room temperature and 
twice in isopropanol before drying by the passage of nitrogen. The dried probe 
was resuspended in the HS and denatured by heating at 94°C for 5 min. 
Thirty microliter of the probe-containing HS was applied to a dried, 
pre-hybridized slide, covered with a cover slip (Corning, Corning, NY), and put 

25 into a sealed hybridization chamber containing a small reservoir of water to 
maintain moisture. Hybridization occurred for approximately 14 h at 35°C. 
Cover slips were removed in washing buffer I,(WR I:= 2xSSC, 0.1% SDS) 
warmed to 35°C prior to incubation ; fpr 5 ; min; Next, the slides were washed 
sequentially for 5 min in lxSSC, 0.1% SDS and O.TxSSC, 0.1% SDS. Slides were 

30 then passed through three baths, each passage lasting 2 min, in 0. lxSSC. The 
slides were dried with a nitrogen gas flow. 

Data Collection and Analysis . Hybridization to each slide was quantified with a 
confocal laser microscope (Molecular Dynamics, Sunnyvale, CA) whose 
photomultiplier tube was set to 700 volts and 800 volts for obtaining Cy3 and Cy5 
35 signals respectively. The images were analyzed with Array Vision 4.0 software 
(Imaging Research, Inc., Ontario, Canada). t The fluorescent intensity associated 
with each spotted gene was reduced by subtracting the fluorescence of an 
adjoining, non-spotted region of the slide: These readings were exported to a 
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spreadsheet for further manipulation. The four "no DNA" spots derived from 
PCR reactions devoid of template were controls used to determine the noise 
(background signal) level. 

The 96 genes present on each slide were used as internal controls to 
5 quantify signal intensities yielding equivalent readings among the three slides of a 
whole genome array set. This corrected for slide-to-slide signal variation. 

For the IPTG induction experiment, it was presumed that the overall 
transcriptional pattern did not change significantly. Thus the summed equivalent 
reading for the entire genome was quantified; analogous quantitation of the 

10 underlying equivalent readings allowed calculation of fold induction of each 
gene's expression by comparison of such quantified equivalent readings. 
RNA abundance . To convert normalized equivalent readings into measures of 
transcript abundance, a further correction was needed.t That correction required 
the hybridization signal arising from an equimolar concentration of all transcripts. 

15 The surrogate for this transcript pool was the fluorescent copy of genomic DNA. 
Thus, the fluorescent intensities from hybridization with RNA-derived probes 
were corrected using fluorescent intensities arising from genomic DNA derived 
probes. Specifically, the abundance of each gene's transcription product(s) was 
determined by dividing the normalized equivalent reading of the genomic DNA 

20 derived sample into the normalized equivalent reading from the RNA derived 
sample. The convention of Riley (Riley and Labedan Escherichia coli and 
Salmonella: Cellular and Molecular Biology ASM Press, 1996)) was followed in 
grouping genes into functional sets. 

EXAMPLE 3 

25 Example 3 demonstrates gene expression profile changes when cell were 

exposed to IPTG, or grown in different culture media. The results are illustrated 
in Tables 1,2 and 3 (Listing of Tables) as described above. 
IPTG Induction An E. coli strain MG1655 was grown overnight in LB at 37°C. 
The culture was diluted 250 fold into fresh LB and aerated at 37°C. When the 
30 culture achieved an appropriate density (A600 =0 - 40 )> ^ was sp 1 ^ in to two 
portions. ^ ; . i \ , , 

To one portion, IPTG was added to a final concentration of 1 mM. The 
other portion was untreated and served as a control, j i \ , , . ^ 

Both samples was incubated with aeration at 37°C for another 15 min 
35 (Agoo = 0- 45 f° r both cultures) before ,RNA isolation; ^Gene, expression analysis 
was performed as described in Exampies^l and 2. \, f! 

Cells were grown in different culture media E. coli MG1655 was cultured 
with aeration overnight in either the minimal medium, M9, supplemented with 
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0.4% glucose or in the rich medium, LB at 37°C. The overnight culture was 
diluted 250 fold into fresh medium and aerated by shaking at 37°C. Samples of 
the minimal medium culture were harvested at A5qq=0.40 (exponential phase) and 
A^oo^l -6 (transition to stationary phase) prior to RNA isolation. The LB culture 
5 was harvested at A^qq=0A prior to RNA isolation. Gene expression analysis was 
performed as described in Examples 1 and 2. 

EXAMPLE 4 

Example 4 demonstrates gene expression changes and the effect on 
mobility when sdiA gene was overexpressed in E. coli. The results are tabulated 
10 in Tables 4 and 5 (Listing of Tables) as described above. 

The following plasmids and strains were used in this example. 

strain or plasmid genotype 

MG1655 rph-1 

RFM443 rpsL galK2 \acA74 

pUC 1 9 Cloning vector 

pDEWHO , ( , pUC19 + ^(EcoRI) t t ; 

Strains and growth conditions . 

15 Strains of MG1655 (Bachmann, B., Escherichia coli and Samonella: 

Cellular and Molecular Biology, ASM Press (1996)) and RFM443 (Menzel R., 
Anal Biochem., 181:40-50 (1989)) have been described. 

pDEW140 was constructed as following: Chromosomal DNA isolated 
from E. coli W31 10 was partially digested with restriction enzyme &U/3A1 and 

20 size fractionated on agarose gels. Fractions of two size ranges (average sizes of 
approximately 2.5 and 4.0 Kbp) were ligated to pBR322 (0.1 1 pmol) or pUCl 8 
(0.1 1 pmol) that had previously been digested with restriction enzyme BamHl and 
treated with calf intestinal alkaline phosphatase. The molar ratio, of chromosomal 
DNA to vector in each of the ligation reactions was approximately 0.2: L The 

25 ligation products were used to transform ultracompetent E. coli XL2Blue 
(Stratagene) to ampicillin resistance. Pooled transformants (>10^ for each 
transformation) were used to isolate plasmid DNA. 

0.3 ng of the pUC18 library was electro-transformed into RFM443. The 
MMC resistant clones were selected on LB agar plates supplemented with 

30 100 [ig/mL of ampicillin and 6 mg/mL of MMC. Resistant colonies appeared 
after the incubation at 37°C. The colonies underwent single colony purification 
on the same medium. Plasmids derived from single colonies were isolated with 
the Qiagen 96-weIl turbo plasmid prep kit. These plasmids served as a template 
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for primer-directed DNA sequencing of the insert ends. One of the plasmids, 
Plasmid p[3+4/B10], was shown by sequencing to carry the sdiA and surrounding 
genes. From this plasmid sdiAv/as amplified by PCR using the primers: 
f primer = TGGCA CGCAG GACAG AA (SEQ ID NO: 1 ) 

5 d primer = TAAC A AATC A GCATA ACTCA T (SEQIDNO:2) 

The PCR used Ampli-Taq Gold. Conditions were 94°C, 1 1 min followed 
by 32 cycles of 94°C for 45 sec, 45°C for 45 sec, 72°C for 90 sec, the 72°C for 
7 min. 

The PCR product was blunt end ligate_d into JscoRV digested pT7Blue-3 

10 (Novagen). A clone having the proper sized fragment was obtained after 

transformation into DH5-alpha. From colonies, inserts of the proper size were 
detected by PCR-based analysis. Such colonies served as a source of plasmid 
DNA from which sdiA was liberated by digestion with EcoRI. The fragment was 
sized by electrophoresis through agarose gels and ligated into EcoRl digested 

15 pUC19. The ligation mixture was used to tranform DHSalpha. Plasmid preps of 
the transformants were sequenced. One such plasmid containing sdiA was named 
pDEW140 and transformed into strain RFM443. 

Plasmids pUC19 and pDEWHO were transformed into RFM 443 selecting 
for ampicillin resistance on solidified LB agar medium. 

20 Strains of RFM443 (pUC19) and RFM443 (pDEWHO) were grown 

overnight with aeration in LB with 150 \xg/mL ampicillin (LB with amp). The 
overnight culture was diluted 250 fold into fresh medium (LB with amp) and 
incubated further at 37°C with shaking. Cells were collected at O.D.600=0.45, 
and total RNA was purified using Qiagen RNeasy mini. 

25 Motility experiment . 

1 A single colony was picked from freshly grown RFM443 (pUC19) or 
RFM443 (pDEWHO) cultured on LB agar (1 .2%), and the center of a LB with 
amp soft agar (0.3%) plate was stabbed. The soft agar plate containing each 
culture was incubated at 37°C. The diameters of the growth zones of the two 

30 strains were measured and compared. 

EXAMPLE 5 

Example 5 demonstrates the differences in gene expression profile 

between strains proficient or deficient in their ability to respond to DNA 

damaging agents. An isogenic pair of strains, differing only in lexA, was used to 

35 investigate the cell's range of responses to the DNA damaging agent mitomycin C 

(MMC). The results are tabulated iii Tables 6,7, and 8 (Listing of Tables) as 
, . .... ■» e -. "ivit • 
described above. . ^. , , ^ 
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Strains E. coli strain, MG1655, was used to determine the gene expression profile 
of E. coli in response to a MMC challenge. Two isogenic E. coli strains (Mount 
et al J, Bacteriol 1 12:886-893 (1972)), DM800 (lexA + ), used as control 
displaying a normal response to DNA damage, and DM803 (/ex/l ind ), a strain 
5 unable to mount the predominant "SOS" response to DNA damage, were 
compared using comprehensive gene expression profiling. 
MMC experiment MG1655 cells were grown in LB overnight with aeration. The 
overnight cultures were diluted 100 fold in LB to final volume of 500 mL and 
grown at 37°C to exponential phase. 200 mL of culture was treated with MMC to 

10 the final concentration of 250 ng/mL. Another 200 mL of culture were mock 

treated without MMC for comparison. Cells were harvested at 15 min and 40 min 
for MG1655 strain. With DM800 and DM803 stains, cells, cultured in an 
identical manner, were harvested after 40 min exposure. RNA was isolated and 
gene expression profile was analyzed as shown in Examples 1 and 2. As seen in 

15 Tables 7 and 8, the lexA allele has a great influence on the response of cells to 
MMC. Table 8 shows that the strain deficient in SOS response still response to 
MMC but in different manner. - ;/*.*" 

i ^ EXAMPLE 6 
Preparation of & Svndctiocvstis sp. ; ?CC6803 cDNA Probes 

20 This example describes the construction of Synechocystis sp. PCC6803 

cDNA probes following growth of the cells in either minimal growth media 
(control) or minimal media plus UV-Bl light treatment. The prepared cDNA 
probes are used to determine gene expression patterns of many genes 
simultaneously on a Synechbcystissp\ PCC6803 DNA microarray as described in 

25 Examples 7 and 8 below - ' ^ '. 

Hybridization of Microarray Slides ahd Quantitation of Gene Expression 

Microarray glass slides were treated with isopropanol for 10 min, boiling 
double distilled water for 5 min, then treated with blocking buffer (3.5 x SSC, 
0.2% SDS, 1% BSA ) for 20 min at 60°C, rinsed five times with double distilled 

30 water, then twice with isopropanol, followed by drying under nitrogen. Cy3 
labeled cDNA probes prepared from the total RNA of the UV-B treated 
Synechocystis culture, mixed with an equal amount of Cy5 labeled cDNA probes 
prepared from the total RNA of the untreated Synechbcystis culture, were applied 
to the glass slide in a total volume of 30 ^iL. The hybridization was repeated 

35 using Cy5 labeled cDNA probes prepared from total RNA of UV-B treated 

Synechocystis culture mixed with an equal amount of Cy3 labeled cDNA probes 
prepared from the total RNA of the untreated culture, and applied to a second 
glass slide in a total volume of 30 jiL. The hybridization reactions on the glass 
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slides were performed for 16 hr at 42°C, in a humidified chamber. Hybridized 
slides were washed in IX SSC (0.15 M NaCl, 0.015 M sodium citrate), 0.1% SDS 
for 5 min at 42°C; 0.1X SSC, 0.1% SDS for 5 min at 42°C; three washes in 0.1X 
SSC for 2 min at room temperature; rinsed with double distilled water and 
5 isppropanol; and dried under nitrogen. The slides were scanned using a Molecular 
Dynamics laser scanner for imaging of Cy3 and Cy5 labeled cDNA probes. The 
images were analyzed using Array Vision Software (Molecular Dynamics, 
Imaging Research) to obtain fluorescence signal intensities of each spot (each 
ORF on the array) to quantitate gene expression. The ratio between the signals in 

10 the two channels (red:green) is calculated and the relative intensity of Cy5/Cy3 
probes for each spot represents the relative abundance of specific mRNAs in each 
sample. A 
Svnechocvstis Strain and Culture Methods 

* Briefly, Synechocystis sp. PCC6803 cells were grown at 30 jiES" ! nr 2 light 

15 intensity in a minimal growth media, BG-1 1 (Catalog # C-3061, Sigma Chemical 
Co., St. Louis, MO) at 30°C, with shaking at 100 rpm with 5% C0 2 . 
Fifty milliliters of Synechocystis cells grown to mid logarithmic phase (OD 7 3 0nm 
= 0.8 to 1 .0) were divided into two 25 mL cultures and transferred from the 
Erlenmeyer growth flask to two 100 mL petri dishes. The petri dishes, with the 

20 lids on, were placed on a rotary shaker and shaken at 1 00 rpm. 
Cell Treatments 

For the control, the petri dishes comprising the Synechocystis cells were 
placed on a rotary shaker with the lids on, and shaken at 100 rpm. For the UV-B 
treated group, the petri dishes comprising the Synechocystis cells were placed on a 

25 rotary shaker with the lids on, and shaken at 100 rpm. A UV-B lamp (302 nm,) 
was positioned above the petri dishes and the distance between the UV-B light 
source and the petri dishes was adjusted to give the desired level of UV-B light 
intensity. The level of UV-B light intensity was measured at the surface of the 
cell culture using a UV light meter, following the manufacturer's instructions. 

30 UV-B treatment was performed for either 20 min or 120 min. Following UV-B 
irradiation, the cells were immediately cooled on ice and their RNA isolated as 
described below. 

Total RNA Isolation and cDNA Probe Synthesis 

Control-treated Synechocystis cells and UV-B treated Synechocystis cells 
35 were cooled rapidly on ice and centriftiged at.4000 rpm for 5 min. Total RNA 
samples were isolated using Qiagen RNeasy Mini Kit (QiagenV following the 
manufacturer's protocol. RNase A digestion was performed as described in the 
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protocol, and a second round purification was performed using the RNeasy Mini 
Kit. The purified total RNA was analyzed by agarose gel electrophoresis. 

From each total RNA preparation, both Cy3 and Cy5 florescent dye 
labeled cDNA probes were prepared. To synthesize the Cy3 or Cy5 labeled 
5 cDNA probes, a reverse transcription reaction was performed using 10 jig total 
RNA, 12 |ig random hexamer (Ambion), 50 \xM of d ATP, dGTP, dTTP, 25 [iM 
of dCTP, and 15 ^iM Cy3-dCTP or 22 jiM Cy5-dCTP (Amersham Pharmacia 
Biotech), DTT, and AMV reverse transcriptase (Gibco BRL). The reaction was 
carried out at 42°C for 2.5 hr. After the labeling reaction, RNA templates were 

10 degraded by alkaline hydrolysis and the cDNA probes were purified using Qiagen 
PCR purification kit. The purified probes were quantitated by measuring the 
absorbance at 260 nm, 550 nm (Cy5 dye incorporation) and 650 nm (Cy3 dye 
incorporation). Prior to hybridization, 100-200 pmol of the purified Cy3 or Cy5 
labeled cDNA probes were dried under vacuum, and re-dissolved in the 

15 hybridization buffer (5x SSC, 50% formamide, 0. 1 % SDS, and 0.03 mg/mL 
salmon sperm DNA). 

EXAMPLE 7 

Analysis of Svnechocvstis sp. PCC6803 Gene Expression in Minimal Media 
Using a Synechocystis sp. PCC6803 DNA microarray prepared according 

20 to the methods described above and the cDNA probes prepared as described in 
Example 6, Applicants have identified herein promoters that can be employed for 
engineering high levels of gene expression in Synechocystis sp. PCC6803, other 
Synechocystis species, Synechococcus, and like organisms. This Example 
describes the identification of the most highly expressed genes and their 

25 corresponding strong promoters in Synechocystis sp. PCC6803 when grown in 
BG1 1 media containing 5 rnM glucose as described above. 

Specifically, a DNA microarray was prepared according to the methods 
described above using DNA isolated from Synechocystis sp. PCC6803 cells 
grown in BG1 1 media containing 5 mM glucose. Minimal media Synechocystis 

30 sp. PCC6803 gene expression was determined by hybridizing this DNA 

microarray as described above with fluorescent cDNA probes synthesized from 
total RNA isolated from Synechocystis sp. PCC6803 cells grown in BG1 1 media 
containing 5mM glucose as described in Example 6. ' 

Briefly, for each minimal media experiment, two hybridization reactions 

35 were performed as described above. Specifically, the first reaction used equal 
molar (typically 100-200 pmol) of Cy5-labeled cDNA from total RNA of the 
minimal media treated sample, and Cy3 -labeled cDNA probes synthesized from 
Synechocystis sp. PCC6803 genomic DNA; the second reaction used Cy3-labeled 
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cDNA from total RNA of the minimal media treated sample, and Cy5-labeled 
cDNA probes synthesized from Symechocystis sp. PCC6803 genomic DNA, The 
signal intensities were quantitated as described above. To calculate the ratio of 
fold induction (i.e., minimal media/genomic), the minimal media treated sample 
5 signal intensities were divided by the signal intensities of the genomic sample. As 
there were two sets of data .from duplicated spotting within each slide, the total 
number of gene expression measurements for each gene was four. All four 
induction ratios for each gene were analyzed using an Excel program (Microsoft) 
to determine the standard deviation; an indicator of the level of confidence for the 

10 specific data set for each gene. The ratio of signal intensities represents a relative 
transcription level of each gene in the same experiment. Herein, Applicants have 
identified the most highly expressed genes, i.e., those genes that are under the 
control of the strongest promoters, in Synechocystis under this minimal media 
condition (see Table 9). 

15 EXAMPLE 8 

Analysis of Synechocystis sv. PCC6803 Gene Expression Following UV-B 

Exposure 

Using a Synechocystis sp. PCC6803 DNA microarray prepared according 
to the methods described above and the probes prepared as described above in 

20 Example 6, Applicants have identified herein UV-B inducible promoters that can 
be employed for engineering high levels of gene expression in Synechocystis sp, 
PCC6803, other Synechocystis species, Synechococcus, and like organisms. This 
Example describes the identification of the most highly UV-B responsive genes in 
Synechocystis sp. PCC6803 when grown under minimal media conditions and 

25 exposed to 20 minutes of UV-B irradiation at 20 (lES^m* 2 intensity. These UV 
inducible promoters can be used to control expression of certain proteins that may be 
toxic to Synechocystis cells. ' J 'i 

Specifically, a DNA microaiTay was prepared according to the methods 
described above using DNA isolated from Synechocystis sp. PCC6803. For each 

30 UV-B treatment experiment, two hybridization reactions were performed as 
described above.- In particular, the first reaction used equal molar (typically 
100-200 pmol) of Cy5-labelUcDNX from total RNA' of the UV-B treated 
sample, and Cy3-labeled cDNA fromUptal RNA of the control sample 
(Synechocystis sp PCC6803 grovm iri BG1 1 media containing 5 mM glucose); 

35 the second reaction used Cy3 -labeled cDN A from total RNA of the UV-B treated 
sample, and Cy5-labeled cDNA from total RNA of the control sample. The signal 
intensities were quantitated as described above. To calculate the ratio of fold 
induction (i.e., UV-B/control), the UV-B treated sample signal intensities were 

36 
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divided by the signal intensities of the control sample. As there were two sets of 
data from duplicated spotting within each slide, the total number of gene 
expression measurements for each gene was four. All four induction ratios for 
each gene were analyzed using an Excel program (Microsoft) to determine the 
5 standard deviation; an indicator of the level of confidence for the specific data set 
for each gene. 

Applicants have identified herein the most highly UV-B induced genes in 
Synechocystis following UV-B treatment (see Table 10). Only genes whose 
expression was induced more than 4 folds by UV-B light (20 min at 20 \\ES- X mr 2 

10 intensity) as compared'to the mimmal media control are listed in Table 10. The 
promoters of these genes can be used to construct UV inducible expression 
vectors in Synechocystis. 

Some of the gene families induced by UV-B light include Dl protein 
(psbA), phycobilisome degradation proteins (nblA, nblB), carotenoid biosynthesis 

15 enzymes (crtD, crtD, crtQ), chaperones (clpB, ctpA, dnaJ, dnaK, htpG, hspl7), 
RNA polymerase sigma factor (rpoD), superoxide dismutase (sodB), high light 
inducible protein (hliA), FtsH protease, which is responsible for the degradation 
of photo-damaged Dl protein (ftsH), and DNA repair enzyme (uvrC). Among the 
group of UV inducible genes, there are several genes of unknown function: 

20 ssr2016, and sll0185. Applicants' discovery has lead to the first level of 

functional assignment for these genes. The promoters of these genes can be used 
to construct UV inducible expression vectors in Synechocystis: * 

A subgroup of Applicants 7 identified UV-B induced genes comprise two 
Escherichia cW/-like -35 promoter sequences in the 5'; upstream untranslated 

25 regions (UTR), including sliri 604 (ftsH), slr0228 (ftsH), si! 1 867 (psbA3), sir 1 3 1 1 
(psbA2), ssl0452 (nblA), ssl0453 (nblA); ssl2542 (hliA), ssr20 16 (unknown 
protein with homologues in green algae and plant), and sll0185 (unknown 
protein). The nucleotide sequence "GTTACA" is present in the 5* untranslated 
regions of psbA2, psbA3, and ssr201<5, nucleic acids? * The nucleotide sequence 

30 ,, TTTACA ,, was also found to be present in the 5* UTR regions of psbA2, psbA3, 
ssr2016, rpoD, and ndhD2 nucleic acids. 
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LISTING OF TABLES 



Table 1 



gene 

lacA 

lacZ 
lacY 

b2324 
uxaA 

bl783/ 

yeaG 

melA 

b0956/ 

ycbG 



function 
thiogalactoside 
acyltransferase 
galactosidase 
galactoside 
permease 
peptidase? 
altronate 
hydrolase 



galactosidase 
hydrogenase? 



LB 

fold 1PTG 
induction 

36 

29 
14 

5.3 
4.0 



2.9 
2.5 



fn a 

4.00E-05 

8.88E-05 
6.07E-05 

2.78E-04 
2.93E-04 



3.6 3.7IE-04 



4.05E-05 
2.63 E-04 



MM exp. phase 

rank b fn rank 

3747 2.46E-07 4244 

2420 2.16E-05 3879 

3125 6.10E-06 4202 

621 7.30E-05 2639 

575 7.82E-05 2530 



MM transition 
phase 



fn 

2.09E-05 

1.98E-05 
1.63E-05 

5.43E-05 
9.03E-05 



401 3.23E-04 576 I.04E-03 

3729 1.36E-05 4050 1.65E-05 
678 1.41 E-04 1573 1.27E-04 



rank 
3816 

3849 
3975 

2717 
1990 

136 

3966 
1529 



10 



a = fraction of particular transcript/summed transcripts hybridizing to all open reading frames on the micro-arrays; 

° = genes are ranked in order of expression with I being the most highly expressed gene 

MM: Minimal media, 

exp phase: exponential growth phase 



Table 2. Highly Expressed Genes under Three Different Culture Conditions 



fraction 3 in LB 





0.0070 


rplK 


0.0068 


rnlk 


0.0066 


rp!d 


0.0048 


hemK 


0.0047 


rpml 


0.0046 


rplW 


0.0044 


rpk! 


0.0043 


acpP 


0.0042 


Ipp 


0.0040 


gW 


(: 0.0039 


fusA 


0.0039 


gotB 


0.0038 


rpsF 


0.0038 


tufB 


0.0037 


ompC 


0.0037 


mopB 


0.0035 


atpF 


0.0035 


hns 


0.0035 


rpmB 


0.0034 



name 


fraction in 


name 


fraction in 




minimal (exp. 




minimal 




phase) 




(transition) 


cspA 


0.0054 


hdeA 


0.016 


metE 


0.0050 


hdeB 


0.0099 


tu/B_ 


0.0048 


rmf 


0.0083 


ompA 


0.0046 


dps 


0.0065 


HvC 


0.0042 


Ipp 


0.0063 


rmf 


0.0038 


ompC 


0.0059 




0.0038 


IcdA 


0.0059 


ompT 


0.0037 


metE 


0.0053 




0.0036 


gapA 


0.0049 


ahpC 


0.0034 


tHfA 


0.0049 


rp!M> ■?. 


, 0.0031 , 


ompA 


0.0044 




' p.0031 ( yv|H^ 




0.0044 


aceB , 


0.0030 


uspA 


0.0040 




0.0029 ' ; : 




0.0039 


rpsJ 


0.0028 


iivC 


0.0037 


cirA s 


0.0028 


rpsN 


0.0036 


gap A 


0.0026 


eno 


0.0036 


rpml 


0.00266 


ahpC 


0.0035 


yjjs 


0.0026 


ompT 


0.0033 


rpmC 


0.0024 


zadA 


0.0033 
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otnpA 


0.0033 


fiisA. 


0 0094 


aceB 


0.0033 


tnctA 


0.0033 


1)2745 


0 0024 


rrtlY 


U.UUji 


fpoA 


0.0033 


OftipF 


0 0091 


JusA 


0.0032 


trmD 


0.0032 


rvnC 


0 0099 


n i.rr 

pisti 


A AA*5 t 

0.003 1 


rpll 


0 0031 


accA 


0 009 1 


rj)sD 


0.0029 


van A 


0 00^0 


pyrB 


0 009 1 
U.UUZ j 


bo 5 12 


0.0029 


rplM 


o on? s 


KPJJS 


U.UUZ I 


gpmA 


0.0028 


rpntG 


0 0098 


rpsD 


U.UUz 1 


metK 


0.0028 






C Y S K 


A A AO A 

U.UUzU 


rpmC 


0.0027 


rnIT 


n 0097 


pisi 


A A AO A 

U.UUzU 


zadB 


0.0027 


rnlX 






A AA I A 

u.uuiy 


rpsV 


0.0027 


priB 


U.VUZO 


rpsS 


A AA 1 A 

U.UU 1 9 


cysK 


0.0026 


OTYlpf* 


n 009^ 


JepA 


A AA 1 A 


rpsJ 
rpsH 


0.0025 


hupA 


0 009^ 


bxlL 


U.UU 1 o 


0.0025 


ypsJ 


o 009 s 


atoF 


A aa t o 
U.UU I K 


rpIE 


0.0025 


rplB 


n 009 s 


rrtP LJ 


A AA 1 *? 

U.UU I / 


aceA 


0.0025 


rplU 


0 009 S 


ypsN 


A AA 1 T 

U.UU 1 / 


02266 


0.0023 


fie 
=s 


0.0024 


OUoUJ 


a Via i i 
U.UUI / 


rjpJM 


0.0023 


tu/A 


0.0024 


otnpC 


'0 AA 17. 


rpsS 


0.0023 


rplD 


0 0094 


rpsA 


A A A 1 T , 
U.UUI / 


nlpu 


0.0022 


rplC 


0 0094 


tUrl 


A A A 1 *T t* 

U.UUI / ^ ' 


acpP 


0.0022 


PQtA 


0 0024 


rpiA. 


A AA1£ 1 
U.UU 10 


rpml 


0.0022 


rpsA 


0.0024 


rn/f ' 


A AA1 a 
U.UU 10 


rpgS ■ 


0.0021 




0 0O91 


vpsM 


A AA1 A. 
U.UU 10 


r/?o^ 


0.0020 


rpsS 


0.0023 


rvt i TO 


0 00 16 
U.UU. 1 0 


/wis 


A AA*>A 

0.0020 


ppa 


0.0022 


KPJB t \.\ 


0.0016 




; 0.0020 


gatZ 


0.0022 


w0793 , 


0.0016 


rplB 


j 0.0020 


cspE 


0.0021 


m 


0.0015 


bl452 


0.0019 


cspC 


0.0021 


icdA 


0.0015 


b0817 


0.0019 


mopA 


0.0021 




0.0015 


bf003 


0.0019 



fraction of transcripts hybridizing to specified gene/summed transcripts hybridizing to all open 
reading frames on the micro-arrays 

bold, double underlined -foldase/usher genes; bold, underlined -stress responsive genes; bold - 
central metabolic enzyme-specifying genes; double underlined -biosynthetic genes; dotted underlined - 
translation-associated genes; underlined -rpoS controlled genes 



Table 3. Summary of three E. coii Expression Profiles 



1. Cell processes 
Cell division -26 c 
Chemotaxis. motility 

Chemotaxis and mobility- 12 
Folding and ushering proteins -7 
Transport of large molecules 

Protein, peptide secretion-32 
Transport of small molecules 

Amino acids, amines-49 

Anions-20 



fraction in fraction in fraction in 

MM a /exp. b MM/ LB/exp. 

phase transition phase 

phase 



0.011 

0.0014 
0.0032 

0.0082 

0.0091 
0.0029 ( 



0.010 

0.00068 
0.0061 

0.01014 

0.0081 
0.0028 



0.010 

0.0011 
0.011 

0.010 

0.0068 
0.0023 



*39 
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Carbohydrates, organic acids, 

alcohols-82 
Cations-52 

Nucleosides, purines, pyrimidines-6 
Other-12 

2. Elements of externa! origin: 

Laterally acquired elements 
Colicin-related 
functions-5 

Phage-related functions and prophages-27 
PI asm id-related functions- 1 
Transposon-related functions-34 

3. Global functions 

Energy transfer, ATP-proton motive force -9 
Global regulatory functions -S 1 

4. Macro molecule metabolism 

Basic proteins 
Basic proteins - synthesis, modification-6 

Macromolecule degradation . t 
Degradation of DNA-23 ' ; f t 
Degradation of RNA- 11 ' , 

Degradation of polysaccharides-3 j 
Degradation of proteins, peptides, glyco-61 

Macromolecule synthesis, modification 

DNA - replication, repair, restr./modiflc'n-89 
Lipoprotein- 11 
Phospholipids- 1 1 
polysaccharides - (cytoplasmic)-6 
proteins - translation and modiflcation-34 
RNA synthesis, modification, DNA transcript'n-27 

Macromolecules 

Glycoprotein ; 
Lipopolysaccharide-13 ■ 

aa-tRNAs < i 

Amino acyl tRNA syn; tRNA modific'n-40 

5. Metabolism of small 
molecules 

Amino acid biosynthesis 
Biosynthesis of cofactors, 
carriers 

Central intermediary metabolism 

2'-DeoxyribonucIeotide metabolism- 1 2 
Amino sugars- 10 
Entner-Douderoff-3 

Gluconeogenesis-4 ''' ' 

Glyoxylate bypass-5 ;■ ^ 

Misc. glucose metaboIism-3 ^ 

Non-oxidative branch, pentose pwy-8 
Nucleotide hydrolysis-2 1 ' ■ 
Nucleotide interconversions-13 

Phosphorus compounds- 17 

Polyamine biosynthesis-8 

Salvage of nucleosides and nucleotides- 1 8 

Sugar-nucleotide biosynthesis, conversions- 18 

40 
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0.020 0.016 0.034 

0.012 0.0098 0.0076 

0.0010 0.00090 0.0017 

0.0021 0.0027 0.0012 

0.024 0.017 0.023 



0.0055 0.0042 0.0065 

0.00017 0.00055 0.00086 

0.0058 0.0035 0.0038 

0.0077 0.0054 0.015 

0.0176 0.029 0.018 



0.0047 0.0048 0.0074 

0.0038 < 0.0030 . 0.0031 

0.0029 0.0015 0.0022 

0.00056 0.00033 0.00040 

0.00842 ' 0.0093 0.011 



0.023 


0.019 


0.031 


0.0041 


0.0050 


0.0037 


0.0020 


0.0015 


0.0021 


0.0015 


0.0016 


0.00060 


0.029 


0.030 


0.043 


0.010 


0.010 


0.015 


.0015 


0.0012 


0.0018 


0.013 


0.013 


0.021 


0.012 


0.0093 


0.0033 


0.072 


0.069 


0.064 


0.0034 


0.0032 


0.0032 


0.0012 


0.0011 


0.0015 


0.00040 


0.00034 


0.00060 


0.00086 


0.0012 


0.0021 


0.0076 


0.0075 


0.0012 


0.00085 


0.00050 


0.00039 


0.0026 


0.0043 


0.0043 


0.00010 


0.000 11 


0.00027 


0.0041 


0.0039 


0.002 


0.0032 


0.0030 


0.0022 


0.0016 


0.0013 


0.0013 


0.0037 


0.0038 


0.0054 


0.0042 


0.0034 


0.0048 
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Sulfur metabolism- 1 0 

Pool, multipurpose conversions of intermed. Mef- 

46 

Degradation of small molecules 
Amines-9 
Amino acids- 17 
Carbon compounds-90 

Fatty acids- 10 

Other-8 
Energy metabolism, carbon 
Aerobic respiration-27 
Anaerobic respiration- 80 
Electron transport-24 
Fermentation-2 1 
Glycolysis- 18 

Oxidative branch, pentose pwy-2 

Pyruvate dehydrogenase-6 

TCA cycle- 18 
Fatty acid biosynthesis 

Fatty acid and phosphatidic acid biosynthesis-23 
Nucleotide synthesis 

Purine ribonucleotide 
biosynthesis-22 

Pyrimidine ribonucleotide 
biosynthesis- 10 

6. Miscellaneous 

Not classified - 109 

7. Open reading frames , 
Unknown proteins - 1 324 

8. Processes 

Adaptation 

Adaptations, atypical conditions- 1 6 

Osmotic adaptation- 14 
Protection responses 

Cell killing-3 

Detoxification- 1 1 

Drug/analog sensitivity-32 

9. Structural elements 

Cell envelope 

Inner membrane-4 

Murein sacculus, peptidoglycan-34 

Outer membrane * 
constituents- 1 7 

Cell exterior constituents- 16 

Surface polysaccharides & antigens 

Surface structures- 5 7 
Ribosome constituents , 

Ribosomal and stable RNAs-3 

Ribosomal proteins - synthesis, modificationRiboso- 

54 

Ribosomes - maturation and modification-6 

10. ORFs not listed- 102 



0.0039 0.0025 0.00095 

0.012 0.019 0.021 



0.0016 
0.0022 
0.014 
0.0020 
0.0015 


0.0010 
0.0014 
0.011 
0.0017 
0.0021 


0.0025 

0.0072 

0.025 

0.0030 

0.00075 


0.0077 
0.0075 
0.0032 
0.0040 
0.013 


0.0058 
0.0057 
0.0024 
0.0050 
0.024 


0.012 

0.011 

0.0048 

0.0044 

0.015 


0.0046 
0.0089 


0.0041 
0.012 


0.0040 
0.0093 


0.0073 
0.019 
0.011 


0.0094 
0.013 
0.0077 


0.015 
0.010 
0.0083 


0.0079 


0.0049 


0.0018 


0.37 
0.022 


0.37 : 
0.023 


0.28 
0.025 


0.35 


0.34 


0.26 


0.012 
0.0038 


0.0083 
0.0063 


0.0074 
0.0026 


0.00052 

0.0080 

0.0042 

i i 


0.00031 

0.0083 

0.0031 


0.00031 
0.0097 
0.0038 


0.0095 
0.023 


0.012 
0.026 


0.013 
0.020 


0.0037 


0.0039 


0.0062 


0.0075 


0.0051 


0.0052 


0.079 


0.086 


0.15 


0.0056 


0.011 


0.00066 
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a MM =MinimaI medium, ^exp. = exponential, c the number following each description is the number 
of genes summed 

Table 4. Gene expression elevated by the presence of a sdiA multi-copy plasmid. 



Genes (grouping by function) 


Fold induction 


Genes (grouping by function) 


Fold induction 


1. Cell processes 




8. Not classified 




Cell division 




agal 


3.3 


fisA 


10. 


chpA 


3.0 


ftsQ 


8.8 


dinl 


2.0 


ftsZ 


11 


dinP 




mtnC 


2.1 


envR 


2.2 


minD 


2.7 


ppdB 


2.9 


minE 


2.4 


sohA 


4.9 


sdiA 


30. 


sugE 


2.2 


sulA 


2.6 


uvrY 


11.9 


Chemotaxis and motility 




9. Open reading frames of 








unknown functions 




1 ransport ot large molecules 




apaG 


2.0 


Protein, peptide secretion 




hdeB 


2.4 


msyo 


2.1 


relE 


2.1 


oppA 


2.1 


sprT 


3.8 


sapB 


2.2 


b0065 


3.7 


secD 


2.5 


b0097 


2.1 


secF 


2.4 


b0135 


6.4 


Transport of small molecules 




b0137 


2.5 


Amino acids, amines 




bOJ38 


2.1 


glnH 


4.0 


bONI 


4.6 




2.5 


b0!63 


2.8 


Carbohydrates, organic acids, ale 




b0189 


2.3 


araE 


5.0 


b0224 


2.1 


frvA 


3.2 


b0225 


6.4 


frwD 


2.1 


b0232 ' 


3.1 


gntU-1 


2.0 


b0233 


2.9 


srlB • 


*> i 
2.1 


b0234 


3.1 


t xvlF 


J.O 


OUZ*tJ 


Z.J 


Cations 




b0269 


3.2 


• bfr 


2.0 


b0281 


2.4 


chaA 


2.0 


bQ295 


2.8 


feoA 


5.8 


b0300 


2.0 


fepD 


2.1 


b0303 


4.7 


trkG 


4.1 


b0322 


2.5 


2. Elements of external origin: 




b0404 


3.0 


Transposon-related functions 




b0407 


2.9 


\rhsC> 

■ t if ( . 


6.3 


b0412 


2.4 
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3. Global regulatory functions 




b0443 


22 


Ion 


2.4 


b0461 


2.1 


Irp 


2.2 


b0498 


2.3 


lytB? 


3.0 


b05!7 


11.2 


rpoE 


2.4 


b05J9 


2.5 


rseA 


2.5 


b0530 


6.5 


rseB 


2.2 


b0534 


2.6 


4. Macromolecule metabolism 




b0567 


2.2 


Degradation of proteins, peptides 




b0625 


3.5 


htrA 


2.3 


b0710 


5.2 


hycl 


2.8 


bQlil 


4.7 


ptr 


2.1 


b0712 


6.4 


Degradation of DNA 




b07I3 


3.8 


endA 


2.2 


b0715 


2.2 


mcrB 


2.1 


b0767 


3.3 


mcrC 


3.5 


b!023 


2.7 


recD 


2.4 


bl024 


2.1 


uvrC 


9.3 


b!069 


3.8 


Macromolecule synthesis, modifl 




bll13 


3.8 


DNA - replication, repair, 




bJ2!4 


2.3 


gidA 


4.1 


b!321 


2.1 


gidB 


2.3 


b!438 


11 


hupB 


4.5 


bl451 


2.0 


mioC 


7.0 


b!454 


2.2 


mutH 


2.2 


bI455 


6.4 


nei 


8.6. 


bl458 


2.3 


priC 


2.8 


bJ463 


2.1 


recN 


3.6 


bl487 


3.0 


umuC 


2.3 


bl49I 


2.3 


uvrA 


2.0 


b!498 


3.7 


xerD 


2.3 


bl499 


2.6 


Lipoprotein 




bl504 


2.1 


blc 


2.8 


bl540 


2.5 


nlpC 


3.2 


b!54J 


6.2 


vacJ 


2.1 


b!542 


3.2 


Phospholipids 




bl543 


2.4 


pgsA 


2.4 


bl544 


3.0 


polysaccharides - (cytoplasmic) 




bl545 


4.4 




2.1 


b!547 


2.2 




2.1 


bl551 


2.1 


proteins - translation and modific 




bl560 


3.8 


prJH 


2.2 


bl565 


3.4 


Lipopolysaccharide 




b!567 


3.5 


rfaK 


2.3. 


b!568 


2.0 


rfaL 


2.1 


bl579 


2.6 


rfaY 


2.1 


bI586 


2.2 
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rfaZ 


2.5 


bl601 


4.0 


5. Metabolism of small molecule 




b!606 


8.0 


Amino acids 




b/607 


7.7 


argA 


3.4 


b/624 


2.1 


aroD 


2.6 


bI627 


2.4 


glnA 


2.5 


b!628 


3.0 


glnD 


2.6 


b!632 


2.1 


lysR 


3.6 


b!648 


2.6 


Biosynthesis of cofactors, carrier 




bI649 


2.7 


thiD 


2.1 


bJ657 


4.7 


thiM 


2.2 


b!664 


2.1 


gst 


3.1 


b!673 


2.4 


Central intermediary metabolism 




bl688 


2.6 


2'-Deoxyribonucleotide metaboli 




b!699 


2.0 


nrdA 


2.1 


bl700 


3.7 


Amino sugars 




bl701 


2.0 


agaD 


4.5 


b!706 


2.4 


Gluconeogenesis 




bJ707 


30 


ppsA 


2.4 


bI72J 


6.5 


Phosphorus compounds 




b!724 


2.1 


psiF 


3.2 


bJ743 


2.2 


Polyamine biosynthesis 




bI744 


2.5 


speC 


9.5 


bl746 


2.9 


Salvage of nucleosides and nude 




b!756 


3.2 


apt 


2.4 


b!789 


3.6 


gsk 


2.4 


b1847 


2.3 


Pool, multipurpose conversions o 




bi848 


2.7 


galM 


2.3 


b!870 


2.4 


gcvA 


4.6 


b1871 


2.8 


glnK 


2.1 


bl875 


3.4 


pntA 


10.4 


b!877 


2.4 


pntB 


8.2 


bl935 


2.1 


Degradation of small molecules 




b!953 


3.5 


Amino acids 


' * 


bJ955 


5.1 


tdcB 


2.0 V 


b!956 


14 


tdcR 




b!965 


5.2 


Carbon compounds 




b!967 


6.6 


fucA 


2.8 


bJ968 


2.4 


fucU 


14 


b2006 


4.4 


galE 


3.8 


b2007 


2.1 


galK 


4.1 


b2015 


3.6 


galT 


4.9 


b2016 


3.5 


gicD 


2.0 


b2017 


3.8 


gusR (uidR) 


8.0 


b206I 


2.2 


lacA 


3-7 


b2071 


3.1 


lad 


2.5 


b2J45 


2.7 
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uxuC 




b2190 


2.3 


Fatty acids 




b2229 


2.1 


atoD 


3.8. 


b2247 


2.3 


Energy metabolism, carbon 




b2253 


2.3 


Aerobic respiration 




b2267 


2,0 


nuoH 


2.1 


b2268 


2.0 


nuol 


2.1 


b2269 


2.6 


Anaerobic respiration 




b2270 


2.2 


dniR 


5.8 


b2301 


3.8 


hybD 


3.8 


b2302 


7.5 


hybE 


12 


b2339 


2.0 


hybF 


6.6 


b2352 


2.6 


hycA 


3.1 


b2356 


2.3 


hycG 


2.7 


b2385 


2.1 


hycH 


3.8 


b2387 


3.6 


hydN 


13 


b2419 


3.2 


hypC 


2.3 


b2420 


3.2 


nrjB 


2.7 


b2439 


2.1 


nrJG 


2.1 


b2443. 


2.7 


Electron transport 1 




52444 


2.1 


appB 


2.5. 


52445 


2.7 


cybC 


2.1 


b2485 


2.2 


Pyruvate dehydrogenase 




b2505 


2.1 


pdhR 


4.4 


52597 


2.8 


TCA cycle 




52628 


2.2 


fumC 


5.8 


52629 


3.4 


sucA 


2.1 


52631 


2.9 


sucB 


2.6 


52632 




sucC 


2.2 •/*• 


52640 


5.0 


sucD 


2.7 


52641 


2.3 


Fatty acid and phosphatidic acid 




52642 


15 


aas 


2.0 


52643 


2.6 


cdh 


2.3 


52648 


3.7 


Purine ribonucleotide biosynthes 




52649 


3 0 


purE 


2.1 


52756 


2.4 


purR 


2.3 


52767 


2.9 


Pyrimidine ribonucleotide 




52833 


2,1 


pyrL 


2.1 


52845 


2.6 


6. Processes 




52846 


3.0 


Detoxification 




52851 


2.6 


cutC 


2.1 


52862 


2.4 


Drug/analog sensitivity 




b2874 


3.3 


acrA 


6.8 


52912 


2.6 


acrD 


3.0 


52931 


2.2 


acrE 


14 


52984 


2.1 


acrF 


6.3 


53021 


2.4 
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acrR 


4.5 


b3022 


2.7 


ampC 


2.6 


b3047 


2.2 


arsC 


2.1 


b3050 


2.2 


toiC 


2.6 


b3!30 


2.7 


7. Structural elements 




b3l42 




Cell envelope 




b3254 


3.8 


Inner membrane 




b3372 


2.2 


stnpA 


2.6 


b3379 


2.0 


Murein sacculus, peptidoglycan 




b3395 


4.6 


ddlB 


4.6 


b3397 


3.5 


hipB 


2.0 


b3398 


2.2 


mreD 


2.0 


b3441 


4.0 


Outer membrane constituents 




b3465 


2.1 


sip 


2.5 


b3467 


2 5 


Cell exterior constituents 




b3487 


2.1 


kdsA 


2.3 


b3494 


2.2 


IpxC 


3.4 


b3513 


5.3 


rfaB? 


2.6 


b3535 


2.2 


Ribosome constituents 




b3536 


2.9 


Ribosomal proteins - synthesis, 




b3548 


2.0 


rpsL 


3.1 


b3615 
b3697 
b371I 
b3712 
b3713 
b3714 
b37J9 
b3720 
b3776 
b3820 
b3888 
b3937 

b3944 

■ ■ * 

b3964 

i. ( f 


2,0 
2.9 
2.3 
2.1 
2.1 
2.2 
2.5 
3.0 
2.9 
2.5 
3.4 
2.1 
2.0 
2.2 






b4038 
b4068 


2.4 






2.5 




i' * 


't s 1 

b414I 
b4)56 


2.3 






2.2 


< 




b4I9I 


5.1 






b422\ 


5.0 






b4222 


*T.O 






b4234 


2.9 






b4248 


2.9 






b4282 


2.1 






b4298 


2.3 






b4300 


2.2 
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b4325 


2.1 


b2088 


9.4 


14404 


5.9 


b4405 


4.2 


yieD (b3695) 


3.8 


b3914 


3.5 


b39!3 


3.3 


viiT(b4352} 


2.8 


yjiV{b4342) 


2.7 


yjjQ{b4364) 


2.4 


yhiL (b3486) 


2.3 


b2848 


2.0 


b3573 


2.0 



Table 5. Gene expression reduced by the presence of a sdiA multi-copy plasmid. 



Genes (grouping by function) Fold repression Genes (grouping by function) Fold repression 

1 . Cell processes 6. Structural elements 

Cell division Outer membrane constituents 



fisX 


2.4 


flu 


1.1 


Taxis and mobility 




Cell exterior constituents 




air (aer) 


4.6 


nanA 


3.7 


cheA 


3.7 


Surface structures 




cheB 


3.3 


flmA 


2.6 


cheR 


2.2 


flgA 


2.3 


cheW 


5.3 


JfgB 


14 


cheY 


4.6 


flgC 


17 


cheZ 


4.0 


feD 


17 


motA 


2.9 


flgE 


17 


motB 


2.9 


flgF 


7.1 


tar 


5.3 


flgG 


13 


tsr 


5.9 


flgH 


5.9 


Transport 




flgl 


5.6 


Protein, peptide secretion 




flgJ 


6.3 


dppA 


2.3 


flgK 


6.3 


Amino acids, amines 




flgL 


11 


sdaC 


3.9 


flgM 


7.1 


Carbohydrates, organic acids 




flgN 


5.6 


alcohols 








fadL 


2.5 


flhA 


3.1 
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glpF 


3.6 


fliA 


10. 


glpT 


2.5 


fliC 


14. 


lamB 


3.0 


fliD 


4.6 


malE 


3.7 


JliE 


5.9 


mglA 


2.7 


JliF 


11 


rbsA 


2.6 


fliG 


9.1 


treB 


5.3 


fliH 


5.0 


Cations 




jlil 


3.9 


fecA 


3.5 


flU 


7.1 


fecB 


2.7 


fliK 


4.8 


fecE 


3.2 


flit 


7.7 


ftn 


2.6 


JliM 


13 


kdpA 


3.5 


fliN 


5.0 


2. Elements of external origin: 




JliO 


4.4 


Phage-related functions and prophages 


fliP 


6.3 


lar 


4.0 


fliR 


3.1 


nmpC 


2.1 


JliS 


5.9 


3. Global regulatory functions 




JliT 


5.6 


cytR 


2.2 


fliZ . 


8.3 


4. Degradation of macromolec 




Ribosomes - maturation and modifu 


DNA 




giitM 


2.1 


xseB 


4.8 


7. Not classified ; 




Proteins, peptides 




fir 


2.1 


pepE 


2.6 


8. Open reading frames with unknot 


5. Metabolism of small molecu 




b0105 


2.4 


Pool, multipurpose conversions 


of intermed. Met* 


50235 


2.1 


glpK 


3.5 


b0290 


3.5 


gW 


2.3 


b0307 


2.4 


gltF 


2.6 


bQ704 


2.7 


Degradation of small molecule 




b0732 


2.0 


Amino acids 




bllOO 


2.5 


sdaB 


3.2 


bll94 


5.0 


tnaA 


4.0 


bllQQ 


2.9 


tnaL 


2.4 


bJ329 


2.2 


Carbon compounds 




bl339 


2.2 


fucR 


2.8 


bJ383 


2.4 


hcZ 


6.7 


bl520 


2.7 


malM 


3.0 


b!566 


4.4 
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ma IT 


2.6 


hi 690 


2 ! 


treC 


2.6 


bl722 


2 2 


Energy metabolism, carbon 




bl880 


3.6 


Anaerobic respiration 




bJ929 


2.1 




2.2 


hi 930 




Fermentation 




b2001 


9 7 


ackA 


3.1 


0£\J\ij 




aldA 


3.3 


U£vl *r 


9 A 


pta 


2.3 


b2537 


2 7 


Fatty acid and phosphatidic acid biosynth 


b2844 


2 7 


accD 


3.0 


b30I0 


9 9 


Purine ribonucleotide biosynth 




b3111 


2 6 


ndk 


2.2 


b3323 


2.9 






b3442 


2.5 






b3539 


9 4 






b3872 


2.1 






yjiZ(b4354) 


3.3 






yjbP^ (b3877) 


4.0 






yhjH(b3524) 


4.8 
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Table 6. Gene expression profiles of MG 1655 strain when exposed to MMC 



m15 


control 


final 


Gene name 


M40 


control 


total 


Gene 






ratio 








ratio 


name 


25115.41 


4700.39 


5.34 


recN 


98536.33 


11895.47 


8.28 


recN 


34003.66 


4565.91 


7.45 


hsIS 


11287.86 


1677.72 


6.73 


hsIS 


1146.46 


769.58 


1.49 


din! 


10515.16 


1564.72 


6.72 


dinl 


2516.14 


1468.51 


1.71 


sulA 


13936.20 


2416.81 


5.77 


sulA 


4531.55 


2030.86 


2.23 


W1816 


18614.34 


3231.00 


5.76 


W1816 


34497.54 


4438.78 


7.77 


hsIT 


21578.60 


5358.30 


4.03 


hsIT 


8490.60 


4896.22 


1.73 


lexA 


22768.75 


6375.09 


3.57 


lexA 


992.44 


1322.83 


0.75 


W0522 


8750.63 


2628.39 


3.33 


W0522 


3207.65 


2219.58 


1.45 


w1815 


16206.30 


4874.54 


3.32 


w1815 


1844.00 


1864.81 


0.99 


W2141 


2156.14 


691.51 


3.12 


W2141 


21169.76 


11055.79 


1.91 


recA 


34375.81 


11386.61 


3.02 


recA 


756.88 


510.08 


1.48 


smpA 


1614.58 


558.50 


2.89 


smpA 


17140.20 


23337.13 


0.73 


cspA 


12566.23 


4347.87 


2.89 


cspA 


165.71 


396.95 


0.42 


ecpD 


227.53 


80.37 


2.83 


ecpD 


780.07 


941.22 


0.83 


W3019 


802.62 


288.87 


2.78 


W3019 


821.35 


733.39 


1.12 


W2878 


1151.78 


439.75 


2.62 


W2878 


301.47 


384.64 


0.78 


entD 


491.14 


189.39 


2.59 


entD 


385.41 


778.62 


0.49 


' fhuC 


818.05 


321.73 


2.54 


fhuC 


7203.63 


6899.86 


1.04 


W1201 


4623.01 


1874.23 


2.47 


W1201 


1476.06 


1291.17 


1.14 


W2999 


1710.57 


709.13 


2.41 


w2999 


269.15 


393.30 


0.68 


caiB 


326.51 


139.49 


2.34 


caiB 


5327:31 


6525.35 


0.82 


infA 


8186.35 


3504.89 


2.34 


infA 


9150.34 


6624.37 


1.38 


uvrA 


33530.31 


14452.89 


2.32 


uvrA 


1657.22 


1816.09 


0.91 


w2879 


3946.52 


1727.30 


2.28 


W2879 


4322.77 


5547.40 


0.78 


insB_2 


6522.25 


2894.35 


2.25 


insB_2 


2310.26 


1778.19 


1.30 


dinD 


5316.67 


2385.63 


2.23 


dinD 


5349.03 


4945.79 


1.08 


secG 


6754.21 


3076.60 


2.20 


secG 


136.50 


367.57 


0.37 


priC 


341.80 


156.11 


2.19 


priC 


617.58 


603.54 


1.02 


W0561 


11989.75 


5479.59 


2.19 


W0561 


2228.66 


2966.21 


0.75 


exbD 


7165.39 


3289.27 


2.18 


exbD 


1282.58 


893.81 


1.43 


umuC 


5697.94 


2659.80 


2.14 


umuC 


6703.55 


7422.39 


0.90 


mioC 


8113.11 


3804.25 


2.13 


mioC 


3289.24 


4228.67 


0.78 


insB 1 


6065.02 


2854.20 


2.12 


insB_1 


4042.60 


3531.25 


1.14 


trkH 


17795.29 


8430.83 


2.11 


trkH 


867.30 


1494.75 


0.58 


w1345 


1026.33 


487.58 


2.10 


W1345 


541.83 


848.15 


0.64 


dniR 


1878.65 


899.98 


2.09 


dniR 


5469.65 


4392.09 


1.25 


uvrB 


14508.65 


6960.78 


2.08 


uvrB 


I \J\J I .UO 


£. 1 OO. 1 1 


U. / /, 


insA_4 


oono oo 


A A A A ^ t~\ 

1111.79 


2.07 


insA_4 


5398.87 


3786.47 


1.43 


ruvA 


10492.52 


5134.38 


2.04 


ruvA 


343.85 


654.22 


0.53 


appY 


815.11 


400.28 


2.04 


appY 


18257.00 


17197.04 


1.06 


xseA 


13206.64 


6494.62 


2.03 


xseA 


1863.91 


1771.47 


1.05 


w0224 


6686.92 


3310.73 


2.02 


W0224 


5595.42 


6241.55 


0.90 


W3139 


11174.79 


5555.32 


2.01 


W3139 


1656.98 


1560.47 


1.06 


W2512 


24511.01 


12207.10 


2.01 


W2512 


349.40 


648.41 


0.54 


W3304 


428.93 


850.84 


0.50 


W3304 


297.42 


326.64 


0.91 


w2228 


271.97 


539.98 


0.50 


W2228 


226.05 


501.68 


0.45 


chaB 


721.77 


1433.44 


0.50 


chaB 


678.56 


880.61 


0.77 


cydA 


699.94 


1392.46 


0.50 


cydA 


1422.74 


2311.13 


0.62 


meIR 


5081.98 


10140.17 


0.50 


meIR 


1051.64 


763.06 


1.38 


W1004 


1265.66 


2528.73 


0.50 


w1004 


386.58 


562.93 


0.69 


hofG 


315.25 


630.05 


0.50 


hofG 


513.59 


611.47 


0.84 


w1429 


360.81 


721.16 


0.50 


W1429 


1256.86 


1688.60 


0.74 


w0299 


9464.02 


18943.08 


0.50 


w0299 
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m15 


control 


final 


Gene name 


M40 


control 


total 


Gene 






ratio 








ratio 


name 


695.83 


807.90 


0.86 


rpiR 


748.66 


1502.59 


0.50 


rpiR 


904.58 


1028.97 


0.88 


celD 


2009.30 


4041.66 


0.50 


celD 


2139.22 


1712.26 


1.25 


w0801 


2801.71 


5636.83 


0.50 


W0801 


749.35 


812.05 


0.92 


W0241 


712.18 


1434.35 


0.50 


w0241 


827.51 


522.31 


1.58 


W0621 


1293.49 


2605.36 


0.50 


W0621 


3746.49 


3442.41 


1.09 


putP 


12834.08 


25859.69 


0.50 


putP 


428.60 


231.66 


1.85 


W4099 


377.87 


764.43 


0.49 


W4099 


197.66 


171.90 


1.15 


prsA 


535.40 


1091.60 


0.49 


prsA 


158.38 


114.75 


1.38 


hybD 


177.80 


363.82 


0.49 


hybD 


83.33 


438.61 


0.19 


sapB 


501.32 


1027.11 


0.49 


sapB 


334.88 


643.56 


0.52 


W3821 


412.40 


845.78 


0.49 


W3821 


388.49 


568.82 


0.68 


W1459 


417.69 


857.67 


0.49 


W1459 


275.47 


170.99 


1.61 


agaD 


222.68 


459.00 


0.49 


agaD 


309.42 


402.34 


0.77 


ccmD 


646.05 


1331.95 


0.49 


ccmD 


328.75 


360.49 


0.91 


cpsG 


1070.07 


2212.14 


0.48 


cpsG 


421.02 


702.69 


0.60 


relB 


1002.28 


2072.50 


0.48 


relB 


1361.48 


1629.16 


0.84 


w2809 


3556.91 


7365.32 


0.48 


w2809 


924.92 


626.14 


1.48 


w0824 


1242.77 


2589.98 


0.48 


W0824 


794.28 


965.82 


0.82 


osmE 


2901.61 


6072.91 


0.48 


osmE 


277.27 


66.53 


4.17 


W0362 


1234.58 


2592.98 


0.48 


W0362 


612.37 


471.85 


1.30 


w1927 


481.28 


1014.05 


0.47 


W1927 


726.43 


629.35 


1.15 


W0211 


639.90 


1353.69 


0.47 


W0211 


779.65 


724.87 


1.08 


W0237 


855.85 


1815.70 


0.47 


w0237 


853.25 


919.94 


0.93 


W2592 


585.96 


1247.13 


0.47 


W2592 


829.44 


1006.55 


0.82 


phnH 


987.67 


2146.99 


0.46 


phnH 


537.66 


1161.05 


0.46 


figA 


577.37 


1255.75 


0.46 


flgA 


656.68 


723.92 


0.91 


W2595 


377.12 


821.43 


0.46 


w2595 


800.14 


800.17 


1.00 


W2600 


792.47 


1732.89 


0.46 


W2600 


69.56 


216.88 


0.32 


pheL 


93.28 


205.15 


0.45 


pheL 


892.86 


431.26 


2,07 


W3049 


1146.62 


2524.70 


0,45 


W3049 


764.84 


318.13 


2.40 


W1031 


1045.43 


2305.88 


0.45 


W1031 


937.41 


1834.97 


0.51 


W0295 


473.49 


1044.97 


0.45 


W0295 


486.50 


540.31 


0.90 


marB 


872.85 


1928.83 


0.45 


marB 


587.05 


622.95 


0.94 


w0665 


786.69 


1744.63 


0.45 


w0665 


711.12 


397.24 


1.79 


W1016 


1399.58 


3109.10 


0.45 


W1016 


903.55 


1024.80 


0.88 


w0298 


8569.57 


19066.70 


0.45 


w0298 


570.42 


464.88 


1.23 


W0812 


1381.42 


3073.94 


0.45 


W0812 


1112.52 


446.12 


2.49 


w2026 


1456.45 


3270.04 


0.45 


W2026 


909.59 


321.83 


2.83 


W0715 


1050.65 


2361.56 


0.44 


W0715 


739.31 


960.38 


0.77 


pyrL 


689.56 


1552.58 


0.44 


pyrL 


1533.58 


1350.22 


1.14 


menE 


1802.56 


4058.69 


0.44 


menE 


38.16 


216.63 


0.18 


rnb 


703.05 


1594.10 


0.44 


rnb 


707.80 


1099.36 


0.64 


fucR 


1770.89 


4026.45 


0.44 


fucR 


973.74 


681.23 


1.43 


W2818 


278.71 


634.22 


0.44 


W2818 


603.99 


388.89 


1.55 


acpD 


1990.24 


4539.42 


0.44 


acpD 


610.87 


662.57 


0.92 


W0489 


332.52 


758.67 


0.44 


W0489 


144.96 


121.61 


1.19 


ppdA 


177.26 


405.05 


0.44 


ppdA 


439.04 


478.63 


0.92 


w1966 


488.90 


1118.27 


0.44 


W1966 


277.30 


324.71 


0.85 


no template 


333.36 


764.15 


0.44 


no 














template 


1245.39 


1015.95 


1.23 


W2401 


1085.45 


2497.91 


0.43 


W2401 


570.79 


872.26 


0.65 


w4094 


1505.90 


3477.91 


0.43 


W4094 


100.79 


255.85 


0.39 


dicC 


543.43 


1266.31 


0.43 


dicC 


1035.07 


1108.91 


0.93 


W0286 


844.03 


1986.63 


0.42 


W0286 


4510.92 


4439.04 


1.02 


selB 


341.91 


808.87 


0.42 


selB 


1625.34 


1568.78 


1.04 


w2733 


3589.01 


8496.95 


0.42 


w2733 
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187.58 


341.03 


0.55 


no template 


301.58 


716.01 


0.42 


no 
















template 


1841.41 


2180.01 


0.84 


trpC 


10007.85 


23940.58 


0.42 


trpC 


m15 


control 


final 


Gene name 


M40 


control 


total 


Gene 






ratio 








ratio 


name 


278.39 


137.91 


2,02 


relF 


449.81 


1077.54 


0.42 


relF 


791.31 


342.34 


2.31 


W1318 


1106.98 


2655.28 


0.42 


W1318 


224.27 


312.05 


0.72 


agaV 


202.28 


490.11 


0.41 


agaV 


791.41 


366.08 


2.16 


W1002 


1315.80 


3189.26 


0.41 


W1002 


890.33 


701.73 


1.27 


w0685 


1124.73 


2728.61 


0.41 


w0685 


622.65 


681.13 


0.91 


potH 


500.10 


1228.03 


0.41 


potH 


993.12 


584.36 


1.70 


W2399 


853.26 


2107.06 


0.40 


W2399 


1275.28 


924.58 


1.38 


metA 


991.35 


2449.36 


0.40 


metA 


146.54 


178.28 


0.82 


lytB 


310.99 


770.46 


0.40 


lytB 


419.51 


730.52 


0.57 


W2987 


638.04 


1593.08 


0.40 


w2987 


827.42 


695.16 


1.19 


W0552 


1225.71 


3084.40 


0.40 


W0552 


568.15 


412.40 


1.38 


W1846 


382.41 


968.10 


0.40 


W1846 


114.29 


119.19 


0.96 


dicB 


533.34 


1351.95 


0.39 


dicB 


842.23 


490.95 


1.72 


W1005 


970.38 


2465.94 


0.39 


W1005 


912.78 


720.90 


1.27 


W2587 


509.45 


1301.74 


0.39 


W2587 


1229.46 


751.44 


1.64 


W1260 


963.45 


2507.31 


0.38 


W1260 


1448.48 


1055.79 


1.37 


W3068 


12241.51 


31901.37 


0.38 


W3068 


1010.15 


949.49 


1.06 


w0551 


2143.72 


5619.44 


0.38 


W0551 


794.65 


573.71 


1.39 


W2599 


616.94 


1633.89 


0.38 


W2599 


905.72 


963.26 


0.94 


w0569 


912.64 


2444.12 


0.37 


W0569 


1708.81 


2679.92 


0.64 


fruR 


7146.34 


19274.42 


0.37 


fruR 


1170.58 


1168.59 


1.00 


W3927 


1323.78 


3637.02 


0.36 


W3927 


2894.91 


2291.60 


1.26 


w3069 


15445.74 


42484.53 


0.36 


W3069 


1162.90 


1058.22 


1.10 


W0162 


5614.02 


15712.25 


0.36 


w0162 


494.27 


467.07 


1.06 


w0564 


1311.37 


3713.19 


0.35 


w0564 


2542.41 


5907.71 


0.43 


lar 


367.19 


1045.32 


0.35 


lar 


145.60 


352.49 


0.41 


agaB 


174.76 


500.90 


0.35 


agaB 


360.63 


406.91 


0.89 


w0356 


893.83 


2593.46 


0.34 


W0356 


146.66 


228.04 


0.64 


ptrB 


583.62 


1710.66 


0.34 


ptrB 


89.69 


85.82 


1.05 


tdcA 


140.71 


420.20 


0.33 


tdcA 


1569.84 


1131.81 


1.39 


W0005 


854.54 


2591.31 


0.33 


w0005 


949.03 


723.77 


1.31 


W2820 


711:72 


2231.62 


0.32 


W2820 


382.29 


200.74 


1.90 


racC* 


145.56 


473.98 


0.31 


racC 


966.21 


528.97 


1.83 


w1323 


814!12 


2717.50 


0.30 


W1323 


2804.26 


3141.20 


0.89 


tolQ 


7715.84 


26048.34 


0.30 


tolQ 


349.28 


732.94 


0.48 


w0535 


238.50 


834.39 


0.29 


W0535 


19047.83 


10107.02 


1.88 


W2546 


13321.17 


46691.65 


0.29 


W2546 


580.46 


495.64 


1.17 


W0553 


734.83 


2584.37 


0.28 


W0553 


213.09 


433.84 


0.49 


W1426 


136.41 


525.28 


0.26 


W1426 


28409.79 


32349.40 


0.88 


glpT 


25094.68 


97868.64 


0.26 


gipT 


271.65 


813.15 


0.33 


sapC 


354.66 


1417.55 


0.25 


sapC 


1502.52 


1107.42 


1.36 


W2597 


631.01 


2727.88 


0.23 


W2597 


274.59 


176.87 


1.55 


ais 


706.69 


3093.84 


0.23 


ais 


191.80 


216.29 


0.89 


celA 


864.54 


3806.84 


0.23 


celA 


109.10 


59.92 


1.82 


ppdB 


38.01 


182.15 


0.21 


ppdB 


249.88 


204.52 


1.22 


agaC 


80.45 


386.08 


0.21 


agaC 


56.62 


13.68 


4.14 


hrpA 


546.07 


2814.78 


0.19 


hrpA 


182.61 


92.19 


1.98 


tdcR 


62.07 


330.94 


0.19 


tdcR 


5374.56 


5767.37 


0.93 


spoil 


10944.76 


60533.88 


0.18 


spoU 


456.55 


279.26 


1.63 


W0549 


956.45 


5470.42 


0.17 


w0549 


195.25 


128.74 


1.52 


agaW 


77.80 


464.51 


0.17 


agaW 


556.82 


343.19 


1.62 


W0548 


783.67 


4816.90 


0.16 


w0548 


177.02 


182.96 


0.97 


alpA 


37.14 


237.77 


0.16 


alpA 
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230.79 63.00 3.66 hybF | 85.60 616.14 0.14 hybF 
Gene names written in bold letters are SOS response genes;Ml5: 15 min exposure 
to MMC;M40: 40 min exposure to MMC 

Table 7. Gene expressions in DM800 and DM803 when exposed to MMC 

5 



Gene name 


b 


DM800 


control 


ratio 


DM803 


DM803 


ratio 




number 


MMC 






MMC 


control 




recN 




98536.3 


11895.5 


8.3 


2454.6 


2089.9 


1.2 


hsIS 




11287.9 


1677.7 


6.7 


426.0 


370.9 


1.1 


dinl 




10515.2 


1564.7 


6.7 


780.0 


516.8 


1.5 


sulA 




13936.2 


2416.8 


5.8 


1167.7 


801.5 


1.5 


W1816 


b1848 


18614.3 


3231.0 


5.8 - 


124.7 


900.3 


0.1 


hsIT 




21578.6 


5358.3 


4.0 


1486.7 


962.6 


1.5 


lexA 




22768.8 


6375.1 


3.6 


2935.3 


2950.2 


1.0 


w0522 


b0531 


8750.6 


2628.4 


3.3 


1299.3 


752.0 


1.7 


W1815 


b1847 


16206.3 


4874.5 


3.3 


236.0 


660.6 


0.4 


w2141 


b2181 


2156.1 


691.5 


3.1 ' 


0.0 


934.4 


0.0 


recA 




34375.8 


11386.6 


3.0 


8505.3 


6677.0 


1.3 


smpA 




1614.6 


558.5 


2.9 


591.3 


624.8 


0.9 


cspA 




12566.2 


4347.9 


2.9 


52966.2 


42356.6 


1.3 


ecpD 




227.5 


80.4 


2.8 


384.4 


287.9 


1.3 


w3019 


b3080 


802.6 


288.9 


2.8 


970.7 


621.6 


1.6 


W2878 


b2939 


1151.8 


439.8 


2.6 


0.0 


373.7 


0.0 


entD 




491.1 


189.4 


2.6 


4916 


328.8 


1.5 


fhuC 




818.1 


321.7 


2.5 


967.4 


712.6 


1.4 


W1201 


b1228 


4623.0 


1874.2 


2.5 , 


2598.5 


2567.4 


1.0 


W2999 


b3059 , 


1710.6 


709.1 ; 


2.4 


1279.4 


■i 1182.6 


1.1 


caiB 


i ■ 


326.5 


139:5 


' 2.3 


* 834.2 


806.6 


1.0 


infA 




8186.4 


3504.9 


2.3. 


'2964.1 


3046.5 


1.0 


uvrA 




33530.3 


14452.9 


2.3 t 


y 6731.4 !, 


5941.1 


1.1 


W2879 


b2940 


3946.5 


1727.3 


2.3 


, 1{ 281.8 


'> 1248.5 


0.2 


insB_2 




6522.2 


2894.3 


2.3 


4566.2 ;fl 


. 4307.9 


1.1 


dinD 




5316.7 


2385.6 


2.2 


} 3182.2 ;N 


2522.4 


1.3 


secG 




6754.2 


3076.6 


2.2 * 


9867.2 


: 8993.5 


1.1 


priC 




341.8 


156.1 


2.2 


,'1151.2 ' 


509.8 


2.3 


W0561 


b0571 


11989.7 


5479.6 


2.2 


>. 0.0 v 


636.5 


0.0 


exbD 




7165.4 


3289.3 


2.2 


1867.7 


1621.7 


1.2 


umuC 




5697.9 


2659.8 


2.1 


1014.3 ; 


i 908.3 


1.1 


mioC 




8113.1 


3804.3 


2.1 


3218.1 


2444.7 


1.3 


insBjl 




6065.0 


2854.2 


2.1 


4533.2 . 


4432.3 


1.0 


trkH 




17795.3 


8430.8 


2.1 


2293.2 . 


1959.1 


1.2 


W1345 


b1374 


1026.3 


487.6 


2.1 


0.0 


522.9 


0.0 


dniR 




1878.7 


900.0 


2.1 


1427.1 


899.9 


1.6 


uvrB 




14508.7 


6960.8 


2.1 


5425.4 


5179.7 


1.0 


insA_4 




2298.2 


1111.8 


2.1 


939.3 


1747.8 


0.5 


ruvA 




10492.5 


5134.4 


2.0 


1819.8 


1918.2 


0.9 


appY 




815.1 


400.3 


2.0 


593.1 


340.6 


1.7 


xseA 




13206.6 


6494.6 


2.0 


17733.8 


8677.0 


2.0 


W0224 


b0231 ; 


6686.9 


3310.7 


2.0, 


. 2353.4 


1760.8 


1.3 


W3139 


b3199 


11174.8 


5555.3 


2.0, : 


■.i; 2171.5 


3228.1 


0.7 


W2512 


b2559 


24511.0 


122Q7.1 


2.0 . 


? 1537.4 


1511.4 


1.0 


entF 


i •: 


179.8 


91.5 


2.0 v, 


494.1 


339.5 


1.5 


glnK 




408.6 


208,4 


2.0 : 


571.9 


> 212.8 


2.7 


insB_4 




6575.4 


3364.4 


2.0 


2823.1 


2738.2 


1.0 



.53 



rnpA 




pheP 




w0491 


b0500 


W4088 


b4183 


w0570 


b0580 


W0221 


b0228 


w1347 


b1376 


rem 




xylF 




W2284 


b2325 


W0627 


b0637 


nip 




w2940 


b3001 


W0591 


b0601 


W0270 


b0278 


W2790 


b2849 


pyrl 




w1018 


b1045 


W2709 


b2767 


W2329 


b2371 


aidB 




W2791 


b2850 


W2605 


b2659 


cmtB 




w3890 


b3975 


gntV 




aspA 




W3210 


b3268 


W0271 


b0279 


W1003 


b1029 


W1017 


b1044 


feoA 




chpS 




W0619 


b0629 


yjjM 


b4357 


W2816 


b2876 


W3272 


b3337 


W3443 


b3507 


W0351 


b0359 


W1527 


M556 


W0263 


b0271 


csgA 




fimZ 




W2314 


b2356 


W2596 


b2649 


yigj 


b3823 


W1025 


b1052 


W1359 


M388 


W0673 


b0690 


fixX 




W4093 


b4188 


w0674 


b0691 


W2400 


b2447 


W1963 


b2004 


W3974 


b4066 


W1242 


b1271 


W0525 


b0534 



5250.2 2686.3 

2467.2 1263.6 

508.2 925.1 

384.5 701.1 

1956.2 3578.9 

1612.4 2950.5 

5383.3 9852.6 
540.5 989.7 

2117.7 3885.5 

452.5 830.6 

1984.5 3647.0 

487.4 895.9 

3777.0 6949.2 

1658.1 3060.6 
1175.1 2170.8 
455.2 841.4 
1174.7 2171.6 

2359.6 4364.5 
1312.0 2440.9 

594.8 1107.2 
1218.9 2269.2 
773.2 1441.3 

6211.7 11600.3 

141.2 263.8 

824.3 1541.5 

1242.4 2324.3 
92135.1 173324.7 

479.3 903.6 

792.9 1496.9 
2145.7 4051.8 
1431.3 2703.5 

470.5 888l8 
1657.3 3141.1 
2054.3 3896.8 
1966.9 3732.7 
1687.7 3205.6 
587!3 1115.5 
446.9 849.1 

1435.0 2731.1 

253.0 482.4 
955.9 1824.5 
522.9 998.6 

536.1 1024.6 

581.8 111313 
370-7 710.0 

542.9 1039.7 

1003.1 1922 J 

2953.5 5661.9 
1558.5 2992.7 
313.0 601.3 

953.2 1839.0 
1466.3 2840.2 
1214.0 2355.8 

537.0 1043.7 

455.1 884.6 
3047.5 5958.6 
930.8 1822.0 



2.0 1722.3 

2.0 2974.0 

0.5 176.3 

0.5 777.9 

0.5 867.8 

0.5 326.2 

0.5 1194.3 

0.5 0.0 

0.5 4289.6 

0.5 894.8 

0.5 0.0 

0.5 1004.9 

0.5 2725.9 

0.5 602.0 

0.5 493.9 

0.5 489.3 

0.5 752.3 

0.5 525.2 

0.5 597.4 

0.5 861.9 

0.5 4969.4 

0.5 508.9 

0.5 3614.7 

0.5 275.0 

0.5 607.0 

0.5 311.4 

0.5 10420.0 

0.5 551.4 

0.5 471.2 

0.5 2.6 

0.5 1035.5 

0.5 1313.4 

0.5 568.1 

0.5 289.0 

0.5 2090.3 

0.5 1046.1 

0.5 430.3 

0.5 591.4 

0.5 0.0 

0.5 726.3 

0.5 484.0 

0.5 703.7 

0.5 424.3 

0.5 1114.7 

0.5 1024.4 

0:5 1372.3 

0.5 177.0 

0.5 328.5 

0.5 249.4 

0.5 1259.9 

0.5 901.6 

0.5 129.0 

0.5 810.8 

0.5 667.7 

0.5 816.0 

0.5 1870.6 

0.5 829.3 



2000.1 


0.9 


2187.8 


1.4 


96.9 


1.8 


583.0 


1.3 


197.6 


4.4 


420.6 


0.8 


1736.1 


0.7 


0.0 




3178.7 


1.3 


911.0 


1.0 


299.3 


0.0 


549.6 


1.8 


1950.7 


1.4 


402.6 


1.5 


428.3 


1.2 


455.5 


1.1 


509.1 


1.5 


272.8 


1.9 


383.4 


1.6 


690.9 


1.2 


3102,9 


1.6 


523.1 


1.0 


2885.4 


1.3 


164.4 


1.7 


403.6 


1.5 


446.1 


0.7 


16655.6 


0.6 


372.6 


1.5 


426.9 


1.1 


0.0 




413.5 


2.5 


1090.4 


1.2 


558.4 


1.0 


43.4 


6.7 


1457.0 


1.4 


858.8 


1.2 


359.9 


1.2 


598.3 


1.0 


72.1 


0.0 


421.9 


1.7 


330.6 


1.5 


599.6 


1.2 


343.7 


1.2 


731.5 


1.5 


605.2 


1.7 


817.5 


1.7 


9as 


2.0 


262.2 


1.3 


313.2 


0.8 


891.9 


1.4 


606.7 


1.5 


350.8 


0.4 


406.3 


2.0 


598.5 


1.1 


462.9 


1.8 


336.3 


5.6 


490.0 


1.7 



wo 



9261 




W4132 


54227 


w3304 


b3369 


w2228 


b2269 


chaB 




cydA 




meIR 




w1004 


b1030 


hofG 




w1429 


b1458 


w0299 


b0307 


rpiR 




celD 




w0801 


b0825 


w0241 


b0249 


w0621 


b0631 


putP 




w4099 


b4194 


prsA 




hybD 




sapB 




w3821 


b3901 


w1459 


b1488 


agaD 




ccmD 




cpsG 




re!B 




w2809 


b2869 


w0824 


b0848 


osmE 




w0362 


bO370 


W1927 


b1963 


W0211 


b0218 


W0237 


b0245 


w2592 




phnH 




flgA 




w2595 


b2648 


w2600 


b2654 


ph©L 




w3049 


b3107 


w1031 




W0295 


b0303 


marB 




w0665 


b0682 


w1016 


b1043 


w0298 


b0306 


w0812 


b0836 


w2026 


b2067 


w0715 


b0732 


DvrL 




msnE 




rnb 




fucR 




w2818 


b2878 


acpD 




w0489 


b0498 


ppdA 


i 


W1966 


b2007 



1602.0 31512 0.5 

428.9 850.8 0.5 

272.0 540.0 0.5 

721.8 1433.4 0.5 

699.9 1392.5 0.5 
5082.0 10140.2 0.5 
1265.7 2528.7 0.5 
315.2 630.0 0.5 

360.8 721.2 0.5 

9464.0 18943.1 0.5 

748.7 1502.6 0.5 
2009.3 4041.7 0.5 

2801.7 5636.8 0.5 

712.2 1434.3 0.5 

1293.5 2605.4 0.5 
12834.1 25859.7 0.5 

377.9 764.4 0.5 
535.4 1091.6 0.5 

177.8 363.8 0.5 

501.3 1027.1 0.5 

412.4 845.8 0.5 
417.7 857.7 0.5 
222.7 459.0 0.5 

646.1 1332.0 0.5 

1070.1 2212.1 0.5 

1002.3 207215 0.5 
3556.9 736513 0.5 

1242.8 2590.0 0.5 

2901.6 6072.9 0.5 
1234.6 2593.0 0.5 

481.3 1014.0 0.5 

639.9 1353.7 0.5 
855.9 1815.7 0.5 

586.0 1247.1 0.5 
987.7 2147.0 0.5 

577.4 1255.7 0.5 

377.1 821.4 0.5 

792.5 1732.9 0.5 
93.3 205.1 0.5 

1146.6 2524.7 0.5 

1045.4 2305.9 0.5 

473.5 1045.0 0.5 
872.9 1928.8 0.5 
786.7 1744.6 0.5 
1399.6 3109.1 0.5 

8569.6 19066.7 0.4 

1381.4 3073.9 0.4 

1456.5 3270.0 0.4 

1050.7 2361.6 0.4 

689.6 1552.6 0.4 

1802.6 4058.7 0.4 
703.1 1594.1 0.4 

1770.9 4026.5 0.4 

278.7 634.2 0.4 

1990.2 4539.4 0.4 
332.5 758.7 0.4 
177.3 405.'0 0.4 
488.9 1118.3 0.4 

in;;; 55 
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3125.1 


2509.8 


1.2 


728.3 


701.9 


1.0 


482.1 


399.3 


1.2 


177.0 


0.0 




963.9 


305.9 


3.2 


1511.6 


1528.2 


1.0 


27.8 


163.0 


0.2 


694.2 


767.7 


0.9 


711.8 


488.3 


1.5 


2048.3 


2846.3 


0.7 


536.3 


419.6 


1.3 


1094.3 


303.1 


3.6 


920.2 


417.5 


2.2 


772.1 


649.4 


1.2 


0.0 


170.6 


0.0 


5693.7 


6221.7 


0.9 


709.5 


539.0 


1.3 


24.5 


0.0 




510.9 


573.9 


0.9 


258.8 


0.0 




945.9 


539.7 


1.8 


994.9 


740.6 


1.3 


966.6 


465.2 


2.1 


92.3 


64.5 


1.4 


0.0 


8.9 


0.0 


0.0 


92.3 


0.0 


1264.0 


1453.8 


0.9 


370.9 


350.1 


1.1 


350.0 


900.3 


0.4 


0.0 


0.0 




824.4 


612.6 


1.3 


198.7 


321.6 


0.6 


355.4 


461.1 


0.8 


1032.5 


566.6 


1.8 


1103.3 


937.7 


1.2 


1482.3 


1396.3 


1.1 


647.4 


431.6 


1.5 


222.5 


309.6 


0.7 


256.4 


258.9 


1.0 


53.4 


180.9 


0.3 


188.9 


101.9 


1.9 


468.2 


461.2 


1.0 


843.0 


130.7 


6.5 


0.0 


592.3 


0.0 


419.6 


203.5 


2.1 


1722.6 


2401.5 


0.7 


356.8 


314.2 


1.1 


22.0 


337.9 


0.1 


154.8 


30.1 


5.1 


520.5 


338.5 


1.5 


4640.7 


928.2 


5.0 


399.4 


0.0 




2264.4 


1572.6 


1.4 


787.6 


511.9 


1.5 


0.0 


287.0 


0.0 


137.3 


183.4 


0.7 


662.6 


505.3 


1.3 


1163.2 


680.8 


1.7 
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no 




333.4 


764.2 


0.4 


470.2 


368.3 


1.3 


template" 
















w2401 


b2448 


1085.4 


2497.9 


0.4 


1160.0 


715.4 


1.6 


W4094 


b4189 


1505.9 


3477.9 


0.4 


1022.3 


932.0 


1.1 


dicC 




543.4 


1266.3 


0.4 


319.5 


67.2 


4.8 


W0286 


b0294 


844.0 


1986.6 


0.4 


498.7 


447.4 


1.1 


selB 




341.9 


808.9 


0.4 


7781.6 


7090.5 


1.1 


W2733 


b2789 


3589.0 


8496.9 


0.4 


5638.1 


3562.2 


1.6 


"no 




301.6 


716.0 


0.4 


408.2 


329.1 


1.2 


template" 
















trpC 




10007.8 


23940.6 


0.4 


421.9 


2562.2 


0.2 


relF 




449.8 


1077.5 


0.4 


0.0 


0.0 




W1318 


b1347 


1107.0 


2655.3 


0.4 


1.4 


308.5 


0.0 


agaV 




202.3 


490.1 


0.4 


673.0 


669.2 


1.0 


w1002 


b1028 


1315.8 


3189.3 


0.4 


288.3 


62.7 


4.6 


w0685 


b0702 


1124.7 


2728.6 


0.4 


198.6 


229.5 


0.9 


potH 




500.1 


1228.0 


0.4 


641.9 


243.8 


2.6 


W2399 


b2446 


853.3 


2107.1 


0.4 


770.2 


485.4 


1.6 


metA 




991.4 


2449.4 


0.4 


2026.6 


1816.4 


1.1 


lytB 




311.0 


770.5 


0.4 


353.9 


181.5 


1.9 


w2987 


b3047 


638.0 


1593.1 


0.4 


509.8 


303.9 


1.7 


W0552 


b0562 


1225J 


3084.4 


0.4 


0.0 


588.7 


0.0 


W1846 


b1878 


382.4 


968.1 


0.4 


1322.3 


1154.7 


1.1 


dicB 




533.3 


1351.9 


0.4 


260.1 


54.8 


4.7 


W1005 


b1031 


970.4 


2465.9 


0.4 


0.0 


106.0 


0.0 


w2587 


b2640 


509.5 


1301.7 


0.4 


557.1 


371.8 


1.5 


W1260 


b1289 


963.5 


2507.3 


0.4 


0.0 


353.6 


0.0 


w3068 


b3127 


12241.5 


319014 


0.4 


2383.9 


3185.5 


0.7 


W0551 


b0561 


2143.7 


5619.4 


0.4 


477.1 


864.7 


0,6 


W2599 


b2653 


616.9 


1633.9 


0.4 


456.5 


107.8 


4.2 


W0569 


b0579 


912.6 


2444.1 


0.4 


673.5 


612.3 


1.1 


fruR 




7146.3 


19274.4 


0.4 


2076.3 


2369.5 


0.9 


w3927 


b4020 


1323.8 


3637.0 


0.4 


2685.9 


2131.5 


1.3 


W3069 


b3128 


15445:/ 
5614.0 


42484.5 


0/4 


499.8 


1153.3 


0.4 


W0162 


b0162 


15712.2 


0.4 


4564.1 


3889.2 


1.2 


W0564 


b0574 


1311.4'. 


3713^2 


0.4 


584.5 


175.8 


3.3 


iar 




367.2 


1045*3 


0.4 


82.1 


122.0 


0.7 


agaB 




174.8 


500.9 


0.3 


371.8 


504.1 


0.7 


w0356 


b0364 


893.8 


2593.5 


0.3 


10.4 


32.6 


0.3 


ptrB 




583.6 


1710.7 


0.3 


246.4 


69.2 


3.6 


tdcA 




140.7 


420.2 


0.3 


612.3 


343.4 


1.8 


W0005 


b0005 


854.5 


2591.3 


0.3 


442.3 


388.2 


1.1 


W2820 


b2880 


711.7 


2231.6 


0.3 


904.4 


609.4 


1.5 


racC 




145.6 


474.0 


0.3 


167.4 


26.9 


6.2 


W1323 


b1352 


814.1 


2717.5 


0.3 


297.1 


235.6 


1.3 


,tolQ 




7715.8 


26048.3 


0.3 


1213.4 


1001.4 


1.2 


W0535 


b0545 


238.5 


834.4 


0.3 


114.4 


162.5 


0.7 


W2546 


b2597 


13321.2 


4669116 


0.3 


2567.6 


4164.9 


0.6 


W0553 


b0563 


734.8 


2584.4 


0.3 


0.0 


277.1 


0.0 


W1426 


b1455 


136.4 


525.3 


0.3 


672.5 


358.6 


1.9 


gipT 




25094.7 


97868.6 


0.3 


7769.7 


14648.6 


0.5 


sapC 




354.7 


1417.6 


0.3 


546.0 


0.0 




W2597 


b2650 


631.0 


2727.9 


0.2 


1000.0 


566.4 


1.8 


ais 




706.7 


3093.8 


0.2 


352.9 


114.6 


3.1 


celA 




864.5 


3806.8 


0.2 


0.0 


512.6 


0.0 


ppdB 




38.0 


182.1 


0.2 


399.9 


345.8 


1.2 


agaC 




80.5 


386.1 


0.2 


598.1 


602.8 


1.0 


hrpA 




546.1 


2814.8 


0.2 


0.0 


147.3 


0.0 
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tdcR 




62.1 


330.9 


0.2 


350.0 


232.7 


1.5 


spoU 




10944.8 


60533.9 


0.2 


4024.5 


3163.9 


1.3 


w0549 


b0559 


956.4 


5470.4 


0.2 


115.5 


287.2 


0.4 


agaW 




77.8 


464.5 


0.2 


479.0 


779.3 


0.6 


w0548 


b0558 


783.7 


4816.9 


0.2 


327.8 


85.0 


3.9 


alpA 




37.1 


237.8 


0.2 


366.3 


269.2 


1.4 


hybF 




85.6 


616.1 


0.1 


332.8 


456.6 


0.7 



Table 8. Gene expressions in DM800 and DM803 when exposed to MMC 



Gene 


P ff 


DM800- 


DM800- 


DM800 ratio 


DM803- 


DM803* 


DM803 ratio 


name 




MMC 


control 


(MMC/control) 


MMC 


control (MMC/contro 


cnab 




721.8 


1433.4 


0.5 


177.0 


0.0 


ii 


gusC 




1069.6 


A A 70 A 

1178.1 


0.9 


39.2 


0.0 




prsA 




535.4 


1091.6 


0.5 


24.5 


0.0 




rnb 




703.1 


1594.1 


0.4 


399.4 


0.0 




rspB 




987.2 


1520.0 


0.6 


480.4 


0.0 




sapB 




501.3 


A A 

1027.1 


0.5 


258.8 


0.0 




sapC 




354.7 


A A A T & 

1417.6 


0.3 


546.0 


0.0 




uxaB 




2051.4 


O if oo o 

2423.2 


0.8 


527.4 


0.0 




WU3o7 


D0375 


1010.4 


1827.8 


0.6 


179.0 


0.0 




WU492 


D0501 


7i7 

717.2 


769.2 


0.9 


85.3 


0.0 




W0521 


D0530 


425.3 b 


746.2 


0.6 


169.9 


0 0 




W0537 


b0547 


TA A A , t* 

711.1 i it 


; 851 .5* J' 


0.8 


;A 87.9 


0 0 




w0544 


b0554 


602.7 


' 558.1 > ' } 


1.1 


98.8 


' \J.\J 




w1114 


b1141 


1523.8 v 


1839.4 *' 


0.8 


30.1 


0 0 

\J.\J 




W0508 


b0517 


2160.9 .■ ', 


2528.0 


0.9 


480.7 


10 


457 Q 


w0541 


b0551 


542.2 


591.7 


0.9 


140.0 


7 fi 


1ft A 


moIR 




4939.4 ^ 


4515.9 


1.1 


5518.5 


51^ fi 


in 7 


w2514 


b2561 


7715.4 


7319.4 


1.1 


23955.8 


2W7 fi 


m 1 

I U. I 


moIR 




3903.5 


3852.3 


1.0 


2355.4 


94^ fi 


Q 7 


w1329 


b1358 


1909.4 


1872.1 


1.0 


649.0 


75 0 


ft fi 


w1381 


b1410 


2104.8 


1895.7 


1.1 


2603.5 


V1Q fi 


ft 4 


W1088 


b1 1 15 


3071.3 


4218.6 


0.7 


1543.9 


184 ft 


ft A 
o.*f 


asdX 




2030.3 


2282.9 


0.9 


1833.5 


227.0 


8 1 


tnrS 




277.1 


318.4 


0.9 


1498.9 


196.8 


7.6 


w0873 


b0898 


2210.6 


1636.1 ; 


1.4 


* 1001.4 


132.9 


7.5 


w0617 


b0627 


4834.3 


7145.9; 


0.7 


l 2171.5 


301.1 


7.2 


WjZOO 




41U/./ 


0439.8 


A O 

1.2 


9485.2 


1318.6 


7.2 


w0999 


b1025 


1233.1 


888.5 ' 


1.4 


199.5 


J28.3 


7.1 


rspA 




1862.1 


2002.0 ! !: 


0.9 


1297.2 


191.4 


6.8 


w0843 


b0867 


2593.1 


2957. 8 i - 


0.9 


1333.5 


197.9 


6.7 


W0619 


b0629 


2054.3 ! 


3896.8' 


0.5 


i' 289.0 


43.4 


6.7 


marB 




872.9 


1928.8 ' 


0.5 


843.0 


-130.7 


6.5 


W1128 


b1155 


865.4 


1384.1' 


0.6 


•157.3 


24.8 


6.3 


napH 




1548.4 


1771.5? 


0.9 


; 2445.8 


392.2 


6.2 


racC 




145.6 ' 


474.0 1 « 


0.3 


'167.4 


? 26.9 


6.2 


W1266 


b1295 


1834.3 


3178.5' ' 


0.6 


1111.8 


178.6 


6.2 


w2857 


b2918 


2013.5: 


2025.3 ) 


1.0 


2905.4 


475.0 


6.1 


pfkB 




5266.9 


4949.8 ' 


1.1 


1148.4 


188.9 


6.1 


w1309 


b1338 


3612.6 


4271.9 • 


0.8 


3943.8 


660.9 


6.0 


napG 




4522.3 


4522.5 ' 


1.0 


5243.5 


900.1 


5.8 


w1242 


M271 


3047.5 


5958.6 


0.5 


1870.6 


336.3 


5.6 


w0991 


b1017 


2292.7 


1982.0 


1.2 


564.2 


101.9 


5.5 


vsr 




5927.8 


3617.9 


1.6 


3758.3 


687.4 


5.5 
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W0996 


b1022 


954.0 


881.7 


1.1 


406.4 


74.4 


5.5 


W0957 


b0983 


1903.7 


1772.1 


1.1 


1280.4 


234.3 


5.5 


W1287 


M316 


4404.7 


4724.2 


0.9 


4813 


89.0 


5.4 


W1113 


M140 


2063.9 


2466.4 


0.8 


1674.5 


315.1 


5.3 


W1168 


b1195 


1068.7 


1662.1 


0.6 


105.1 


19.9 


5.3 


W0918 


b0943 


964.3 


1122.4 


0.9 


166.1 


319 


5.2 


hsIJ 




1701.0 


1353.6 


1.3 


952.9 


183.4 


5.2 


fdnH 




2916.7 


3353.8 


0.9 


1551.5 


299.3 


5.2 


w0715 


b0732 


1050.7 


2361.6 


0.4 


154.8 


30.1 


5.1 


menE 




1802.6 


4058.7 


0.4 


4640.7 


928,2 


5.0 


W0511 


b0520 


2664.2 


3109.8 


0.9 


2221.0 


452.5 


4.9 


W2508 


b2555 


3575.2 


2467.8 


1.4 


5237.2 


1078.9 


4.9 


pspC 




670.6 


809.3 


0.8 


370.0 


76.5 


4.8 


W0767 


b0791 


2562.9 


3359.1 


0.8 


2376.4 


497.9 


4.8 


dicC 


- - 


- 543.4 


1266.3 


0.4 


319:5 


67.2 


4.8 


dicB 




533.3 


1351.9 


0.4 


260.1 


54.8 


4.7 


chaC 




3102.1 


3008.4 


1,0 


1480.1 


314.6 


4.7 


w0387 


b0395 


2772.4 


3361.1 


0.8 


1676.5 


362.0 


4.6 


W0986 


M012 


1670.0 


1494.6 


1.1 


1627.0 


352.8 


4.6 


W1002 


M028 


1315.8 


3189.3 


0.4 


288.3 


62.7 


4.6 


W0570 


b0580 


1956.2 


3578.9 


0.5 


867.8 


197.6 


4.4 


W1972 


b2013 


2191.0 


2234.9 


1.0 


1878.6 


435.2 


4.3 


gatR 




16844.2 


12487.4 


1.3 


2559.5 


598.3 


4.3 


w2582 


b2634 


3769.3 


4821.9 


0.8 
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Table 9 

Most highly expressed genes in Synechocystis sp. PCC6803 in minimal growth media 

(BG11 +5mM glucose). 



Systematic 
Name 


Gene 


Function 


Transcript 
copy in total 
mRNA 
(Average 
copy=l) 


slr2051 


cpcG 


phycobilisome rod-core linker 
polypeptide CpcG 


64.91 


S1I1580 


cpcC 


phycocyanin associated linker protein 


22.71 


slr0447 


amiC 


negative aliphatic amidase regulator 


19.45 


sH1070 


tktA 


transketolase 


19.24 


S1I0018 


cbbA 


fructose- 1, 6-bisphosphate aldolase 


14.27 


sirOOIl 


rbcX 


ND* 


12.00 


ss!0563 


psaC 


photosystem I subunit Vll 


11.31 


s!rl655 


psaL 


photosystem I subunit XI . 


10.91 


sl!0819 


psaF 


photosystem I subunit III 


10.56 


SH1867 


psbA3 


photosystem H Dl protein t 


10.43 


SI11324 


atpF 


ATP synthase subunit b 


10.37 


sill 746 


rp!12 


SOS ribosomal protein LI 2 


10.13 


S1I1099 


tufA 


protein synthesis elongation factor Tu 


9.48 


slr0009 


rbcL 


ribulose bisphosphate carboxylase large 
subunit 


8.39 


slr0012 


rbcS 


ribulose bisphosphate carboxylase small 
subunit 


8.14 


SI11326 


atpA 


ATP synthase a subunit 


7.72 


slr!908 




ND* 


7.62 


sU1578 


cpcA 


phycocyanin a subunit 


7.60 


slr2067 


apcA 


allophycocyanin a chain 


7.51 


slr2052 




ND* iivi -i ■;• 


7.41 


sill 184 


ho 


heme oxygenase 


7.27 


ssl3437 


rpsl7 


30S ribosomal protein SI 7 


7.26 


slI1786 




hypothetical protein (ND*) 


7.16 


ssl0020 


.petF 


ferredoxin 


7.07 


stll812 


rps5 


3 OS ribosomal protein S5 


7.04 



* ND = 



not determined 
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Table 10 



Most highly induced genes in Synechocystis sp. PCC6803 in BG1 1 media containing 5 mm 
elucose. with 20 min of UV-B treatment at 20 uKS-lm-2 mtpncitv 



Systematic 
Name 


Gene 


Function 


Data 
/Control 


STD 


ssr2595 


nliB 


High light- inducible protein 


22.7 


4.7 


_ 1 1 C A A 

slrl544 




ND* 


15.5 


7.6 


slI0528 




ND* 


12.1 


3.9 


sll 1514 


hspl7 


small heat shock protein 


9.9 


3.9 


slr!687 


nblB 


phycobilisome degradation protein NblB 


8.2 


1.9 


slll483 




transforming growth factor induced protein 


7.8 


2.2 


SII2012 


rpoD 


RNA polymerase sigma factor 


6.3 


2.0 


ssll633 




CAB/ELIP/HLIP superfamily 


6.0 


1.0 


ssI2542 


hliA 


high light-inducible protein 


5.6 


1.6 


sll0846 




ND* 


4.7 


0.9 


Mil Q /*f 




INL/ 


' 4.7 


1 .8 


slrl604 


ftsH 


Chloroplast associated protease FtsH 


4.6 


1.9 


slr0320 




ND* 


4.5 


2.2 


sI10306 


rpoD 


RNA polymerase sigma factor 


4.4 


1.0 


s!r0228 


ftsH 


cell division protein FtsH 


4.3 


1.7 


slrl641 


dpB 


ClpB protein 


4.3 


1.1 


ssr2016 




ND* 


4.2 


2.2 


sll 1867 


psbA3 


photosystem II Dl protein 


4.1 


0.3 



i:' '*■* 



i" I 
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CLAIMS 

What is claimed is: 

I . A method for identifying gene expression changes within a bacterial 
species comprising: 



10 



15 



(a) 
(b) 
(c) 

(d) 

(e) 



20 



(f) 

I." 
(g) 



providing a comprehensive micro-array synthesized from DNA 
comprised in a bacterial species; 

generating a first set of labeled probes from bacterial RNA, the 
RNA isolated from the bacterial species of step (a); 
hybridizing the first set of labeled probes of step (b) to the 
comprehensive micro-array of step (a), wherein hybridization 
results in a detectable signal generated from the labeled probe; 
measuring the signal generated by the hybridization of the first 
set of labeled probe to the comprehensive micro-array of 
step (c); 

subjecting the bacterial species of step (a) to a gene expression 
altering condition whereby the gene expressioii profile of the 
bacterial species is altered to produce a modified bacterial 
species ; > ' ' \y'^iu ,* = ■ >. 

generating a second set of labeled probes from bacterial RNA, 
the RNA isolated from the modified bacterial species of step (e); 



25 



30 



35 



hybridizing the second, set of labeled probes of step (f) to the 
' ,y comprehensive micro-array of step (a), wherein hybridization 

results t in a detectable signal generated from the labeled probe; 

(h) measuring the signal generated by the hybridization of the 
second set of labeled probes to the comprehensive micro-array 
of step (g); and 

(i) comparing signal generated from the first hybridization to the 
signal generated from the second hybridization to identify gene 
expression changes within a bacterial species. 

2. A method for identifying gene expression changes within a bacterial 
species comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
comprised in a bacterial species ; 

(b) generating a first set of fluorescent cDNA from bacterial RNA, 
the RNA isolated from the bacterial species of step (a); 

(c) hybridizing the first set of fluorescent cDNA of step (b) to the 
comprehensive micro-array of step (a), wherein hybridization 
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10 



15 



20 



25 



30 



results in a detectable signal generated from the fluorescent 
cDNA; 

(d) measuring the signal generated by the hybridization of the first 
set of fluorescent cDNA to the comprehensive micro-array of 
step (c); 

(e) subjecting the bacterial species of step (a) to a gene expression 
altering condition whereby the gene expression profile of the 
bacterial species is altered to produce a modified bacterial 
species; 

(f) generating a second set of fluorescent cDNA from bacterial 
RNA, the RNA isolated from the modified bacterial species of 
step (e); 

(g) hybridizing the second set of fluorescent cDNA of step (f) to the 
comprehensive micro-array of step (a), wherein hybridization 
results in a detectable signal generated from the fluorescent 
cDNA; ; ; > 

(h) measuring the signal generated by the hybridization of the 
second set of fluorescent cDNA to the comprehensive micro- 
array of step (g); and 

(i) comparing signal generated from the first hybridization to the 
signal generated from the second hybridization to identify gene 
expression changes within a bacterial species . 



3. A method according to either Claim 1 or 2 wherein the bacterial 
species is selected from the group consisting of enteric bacteria, Bacillus, 
Acinetobacter, Streptomyces, Methylobacter, Pseudomonas, Rhodobacter and 
Synechocystis 

4. A method according to either Claim 1 or 2 wherein the signal 
generating label is selected from the group consisting of fluorescent moieties, 
chemiluminescent moieties, particles, enzymes, radioactive tags. 

5. A method according to Claim 4 wherein the signal generating label is 
a fluorescent moiety and is selected from the group consisting of cy3 and cy5. 

6. A method according to either Claim lor 2 wherein the comprehensive 
micro-array contains at least 75% of all open reading frames in the bacterial 
species. 

7. A method according to Claim 6 wherein the comprehensive micro- 
array contains from about 2000 to about 6000 open reading frames. 

8. A method according to either Claim 1 or 2 wherein the gene 
expression altering condition is ? selected from the group consisting of a condition 



I 
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altering the genotype of the bacterial species, a condition altering the growth of 
the bacterial species , exposure to mutagens , antibiotics, UV light, gamma-rays, 
x-rays, phage, macrophages, organic chemicals, inorganic chemicals, 
environmental pollutants, heavy metals, changes in temperature, changes in pH, 
5 conditions producing oxidative damage, DNA damage, anaerobiosis, depletion or 
addition of nutrients, addition of a growth inhibitor, and desiccation. 

9. A method for identifying gene expression changes within a genome 
comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
10 comprised in a prokaryotic or eukaryotic speices; 

(b) generating a control set of fluorescent cDNA from total or 
polyadenylated RNA, the RNA isolated from the species of 
step (a), the fluorescent cDNA comprising at least one first 
fluorescent label and at least one different second fluorescent 

15 label; 

(c) mixing the control set of fluorescent cDNA labeled with the at 
least one first label with the control set of fluorescent cDNA 
labeled with the at least second first label to for a dual labeled 
control cDNA; 

20 (d) hybridizing the dual labeled control set of fluorescent cDNA of 

step (c) to the comprehensive micro-array of step (a), wherein 
hybridization results in a detectable signal generated from the 
fluorescent cDNA; f .. ■ ■ , ; M t i 

(e) measuring the signal generated by the hybridization of the dual 
25 labeled control set of fluorescent cDNA to the comprehensive 

micro-array of step (c); 

(f) subjecting the prokaryote or eukaryote of step (a) to a gene 
expression altering condition whereby the gene expression 
profile of the prokaryote or eukaryote is altered to produce a 

30 modified prokaryote or eukaryote ; 

(g) generating an experimental set of fluorescent cDNA from total 
or polyadenylated RNA, the RNA isolated from the modified 
prokaryote or eukaryote of step (e), the fluorescent cDNA 
comprising the first fluorescent label and the different second 

35 fluorescent label to step (b); ^ M | 

(h) mixing the experimental set of fluorescent cDNA labeled with 
the at least one first f label with the experimental set of 
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fluorescent cDNA labeled with the at least second first label to 
form a dual labeled experimental cDNA; 

(i) hybridizing the experimental set of fluorescent cDNA of 

step (h) to the comprehensive micro-array of step (a), wherein 
5 hybridization results in a detectable signal generated from the 

fluorescent cDNA; 

0) measuring the signal generated by the hybridization of the 
second set of fluorescent cDNA to the comprehensive micro- 
array of step (g); and 
10 (k) comparing signal generated from the dual labeled control 

hybridization with the dual labeled experimental hybridization 
to identify gene expression changes within a prokaryotic or 
eukaryotic species. 

1 0. A method according to Claim 9 wherein the first fluorescent label and 
15 the second fluorescent label is independently selected from the group consisting of 

cy3 and cy5. 

11. A method according to Claim 9 wherein the prokaryotic or eukaryotic 
genome is comprised within an organism selected from the group consisting of 
enteric bacteria, Bacillus, Acinetobacter, Streptomyces, Methylobacter, 

20 Pseudomona, cyanobacteria, yeasts, filamentous fungi, plant cells and animal 
cells. 

12. A method according to Claim 1 1 wherein yeast are selected from the 
group consisting of Saccharomyces, Zygosaccharomyces, Kluyveromyces, 
Candida, Hartsenula, Debaryomyces, Mucor, Pichia and Torulopsis. 

25 1 3 . A method according to Claim 1 1 wherein cyantobacteria are selected 

from the group consisting of Rhodobacter and Synechocystis. 

14. A method according to Claim 1 1 wherein filamentous fungi are 
selected from the group consisting of Aspergillus and Arthrobotrys. 

15. A method for quantitating the amount of protein specifying UNA 
30 contained within a genome comprising: .* i : r ; Vp i ;f ?. ! 

(a) providing a comprehensive micro-array comprising a 
multiplicity of open reading frames synthesized from genomic 
DNA comprised inja! prokaryotic or eukaryotic organism; 

(b) generating a set of fluorescent cDNA from total or poly- 

35 adenylated RNA isolated from the prokaryotic or eukaryotic 

organism of step (a); 

(c) generating a set of fluorescent DNA from genomic DNA 
isolated from the prokaryotic or eukaryotic organism of step (a); 
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(d) hybridizing the fluorescent cDNA of step (b) to the 

comprehensive micro-array of step (a), wherein hybridization 
results in a first fluorescent signal generated from the fluorescent 
cDNA for each open reading frame; 
5 (e) hybridizing the fluorescent DNA of step (c) to the 

comprehensive micro-array of step (a), wherein hybridization 
results in a second fluorescent signal generated from the 
fluorescent DNA for each open reading frame; and 
(f) dividing, for each open reading from , the first fluorescent signal 
10 _ into. the second fluorescent signal to provide a quantitated 

measure of the amount of protein specifying RNA for each open 
reading frame. 

16. A method for quantitating the amount of protein specifying RNA 
contained within a genome comprising: 

15 (a) providing a comprehensive micro-array comprising a 

multiplicity of genes synthesized from genomic DNA 
comprised in a prokaryotic or eukaryotic organism; 

(b) generating a set of fluorescent cDNA from total or poly- 
adenylated RNA isolated from the prokaryotic or eukaryotic 

20 organism of step (a); 

(c) generating a set of fluorescent DNA from genomic DNA 
isolated from the prokaryotic or eukaryotic organism of step (a); 

(d) hybridizing the fluorescent cDNA of step (b) to the 
comprehensive micrd-array of step (a), wherein hybridization 

25 results in a first flubrescent : signal generated from the 

fluorescent cDNA for each gene; 

(e) hybridizing the fluorescent DNA of step (c) to the 
comprehensive micro-array of step (a), wherein hybridization 
results in a second fluorescent signal generated from the 

30 fluorescent DNA for each gene; and ■ * : ' 

(f) dividing, for each open reading from , the first fluorescent signal 
into the second fluorescent signal to provide a quantitated 
measure of the amount of protein specifying RNA for each gene. 

17. A method for identifying gene expression changes within a bacterial 
35 species according to either Claim 1 or 2 providing for quantitating the amount of 

protein specifying RNA contained within a genome according to either Claim 1 5 
or 16. 

■ • ,. ; ■ M 
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1 8. A method for identifying gene expression changes within a genome 
according to Claim 8 providing for quantitating the amount of protein specifying 
RNA contained within a genome according to Claim 15 or 16. 
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